< 

oo 



On a Greedy 2-Matching Algorithm and Hamilton Cycles in 
Random Graphs with Minimum Degree at Least Three. 



Alan Frieze* 
, Department of Mathematical Sciences, 

^ I Carnegie Mellon University, 

bjQ' Pittsburgh PA15217. 



August 9, 2011 



o. 

^ . Abstract 

r^ ' We describe and analyse a simple greedy algorithm 2greedy that finds a good 2-matching 

C^ . M in the random graph G = G^^"^ when c > 15. A 2-matching is a spanning subgraph of 

maximum degree two and G is drawn uniformly from graphs with vertex set [n], en edges and 

minimum degree at least three. By good we mean that M has O(logn) components. We then 

fT^ . use this 2-matching to build a Hamilton cycle in 0(n^'''+°'^^') time w.h.p.. 

> 

Q\^ '. 1 Introduction 

^, 

r^ I There have been many papers written on the existence of Hamilton cycles in random graphs. 

Komlos and Szemeredi [T7], Bollobas [S], Ajta, Komlos and Szemeredi pQ showed that the question 
is intimately related to the minimum degree. Loosely speaking, if we are considering random graphs 
with n vertices and minimum degree at least two then we need ^(nlogn) edges in order that they 

^ . are likely to be Hamiltonian. 

C^ ' For sparse random graphs with 0{n) random edges, one needs to have minimum degree at least 

three. This is to avoid having three vertices of degree two sharing a common neighbour. There 
are several models of a random graph in which minimum degree three is satisfied: Random regular 
graphs of degree at least three, Robinson and Wormald [21], [22] or the random graph G^-out, 
Bohman and Frieze [5]. Bollobas, Cooper, Fenner and Frieze [7] considered the classical random 
graph Gn,m with conditioning on the minimum degree k i.e. each graph with vertex set [n] and 
m edges and minimum degree at least k is considered to be equally likely. Denote this model of 
a random graph by G^J^. They showed that for every A; > 3 there is a c^ < (A; -|- 1)^ such that 
if c > Cfc then w.h.p. Gf^J^ has {k — l)/2 edge disjoint Hamilton cycles, where a perfect matching 
constitutes half a Hamilton cycle in the case where k is even. It is reasonable to conjecture that 
Ck = k/2. The results of this paper and a companion J14j reduce the known value of C3 from 64 to 
below 15. It can be argued that replacing one incorrect upper bound by a smaller incorrect upper 
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bound does not constitute significant progress. However, the main contribution of this paper is to 
introduce a new greedy algorithm for finding a large 2-matching in a random graph and to give a 
(partial) analysis of its performance and of course to apply it to the Hamilton cycle problem. 

One is interested in the time taken to construct a Hamilton cycle in a random graph. Angluin and 
Valiant [2] and Bollobas, Fenner and Frieze [8] give polynomial time algorithms. The algorithm in 
[2] is very fast, 0(n log n) time, but requires Knlogn random edges for sufficiently large K > 0. 
The algorithm in [8] is of order n^~^°^^' but works w.h.p. at the exact threshold for connectivity. 
Frieze |12j gave an 0{n"^~^°^^') time algorithm for finding large cycles in sparse random graphs and 
this could be adapted to find Hamilton cycles in G^^^ in this time for sufficiently large c. Another 
aim of [13] and this paper is reduce this running time. The results of this paper and its companion 
[14| will reduce this to fii-5+o(i) for sufficiently large c, and perhaps in a later paper, we will further 
reduce the running time by borrowing ideas from a linear expected time algorithm for matchings 
due to Chebolu, Frieze and Melsted [TOl. 



The idea of [10] is to begin the process of constructing a perfect matching by using the Karp-Sipser 
algorithm [16] to find a good matching and then build this up to a perfect matching by alternating 
paths. The natural extension of this idea is to find a good 2-matching and then use extension- 
rotation arguments to transform it into a Hamilton cycle. A 2-matching M of G is a spanning 
subgraph of maximum degree 2. Each component of M is a cycle or a path (possibly an isolated 
vertex) and we let k{M) denote the number of components of M. The time taken to transform 
M into a Hamilton cycle depends heavily on k{M). The aim is to find a 2-matching M for which 
k{M) is small. The main result of this paper is the following: 

Theorem 1. There is an absolute constant cq > and such that if c > cq then w.h.p. 2GREEDY 
finds a 2-matching M with k{M) = O(logn). (This paper gives an analytic proof that cq < 15. We 
have a numerical proof that cq < 2.5 j. 



Given this theorem, we will show how we can use this and the result of J14j to show 
Theorem 2. If c> cq then w.h.p. a Hamilton cycle can be found in 0{n^'^~^°^^') time. 

Acknowledgement: I would like to thank my colleague Boris Pittel for his help with this paper. 
He ought to be a co-author, but he has declined to do so. 

2 Outline of the paper 

As already indicated, the idea is to use a greedy algorithm to find a good 2-matching and then 
transform it into a Hamilton cycle. We will first give an over-view of our greedy algorithm. As 
we proceed, we select edges to add to our 2-matching M. Thus M consists of paths and cycles 
(and isolated vertices). Vertices of the cycles and vertices interior to the paths get deleted from 
the current graph, which we denote by F. No more edges can be added incident to these interior 
vertices. Thus the paths can usefully be thought of as being contracted to the set of edges of a 
matching M* on the remaining vertices of F. This matching is not part of F. We keep track of the 
vertices covered by M* by using a 0/1 vector b so that for vertex v, b{v) is the indicator that v is 



covered by M* . Thus when v is still included in F and h{v) = 1, it will be the end-point of a path 
in the current 2-niatching M. 

The greedy algorithm first tries to cover vertices of degree at most two that are not covered by 
M or vertices of degree one that are covered by M. These choices are forced. When there are no 
such vertices, we choose an edge at random. We make sure that one of the end-points u, v of the 
chosen edge has 6-value zero. The aim here is to try to quickly ensure that h{v) = 1 for all vertices 
of r. This will essentially reduce the problem to that of finding another (near) perfect matching in 
r. The first phase of the algorithm finishes when all of the vertices that remain have 6-value one. 
This necessarily means that the contracted paths form a matching of the graph T that remains 
at this stage. Furthermore, we will see that F is distributed as G^^ for some z/, /i and then we 
construct another (near) perfect matching M** of F by using the linear expected time algorithm 
of [To]. We put M and M** together to create a 2-matching along with the cycles that have been 
deleted. Note that some vertices may have become isolated during the construction of M and these 
will form single components of our 2-matching. The union of two random (near) perfect matchings 
is likely to have O(logn) components. Full details of this algorithm are given in Section HI 

Once we have described the algorithm, we can begin its analysis. We first describe the random 
graph model that we will use. We call it the Random Sequence model. It was first used in Bollobas 
and Frieze [9] and independently in Chvatal [11] . We used it in [3] for our analysis of the Karp- 
Sipser algorithm. We prove the truncated Poisson nature of the degree sequence of the graph F that 
remains at each stage in Section O We then, in Section lU give a detailed description of 2greedy. 
In Section [5] we show that the distribution of the evolving graph F can be succinctly described 
by a 6-component vector v = {yi,y2,zi,y, z, fi) that evolves as a Markov chain. Here yj,j = 1,2 
denotes the number of vertices of degree j that are not incident with M and zi denotes the number 
of vertices of degree one that are incident with M. y denotes the number of vertices of degree at 
least three that are not incident with M and z denotes the number of vertices of degree at least two 
that are incident with M. fi denotes the number of edges. It is important to keep C = 2/i + 2/2 + -^i 
small and 2greedy will attempt to handle such vertices when ^ > 0. In this way we keep C small 
w.h.p. throughout the algorithm and this will mean that the final 2-matching produced will have 
few components. Section [6] first describes the (approximate) transition probabilities of this chain. 
There are four types of step in 2greedy that depend on which if any of yi,y2, z\ are positive. Thus 
there are four sets of transition probabilities. Given the expected changes in v, we first show that 
in all cases the expected change in ^ is negative, when (^ is positive. This indicates that C will not 
get large and a high probability polylog bound is proven. 

We are using the differential equation method and Section [7] describes the sets of differential equa- 
tions that can be used to track the progress of the algorithm w.h.p.. The parameters for these 
equations will be v = {yi,y2, z, y, z, fi). There are four sets of equations corresponding to the four 
types of step in 2greedy. It is important to know the proportion of each type of step over a small 
interval. We thus consider a sliding trajectory i.e. a weighted sum of these four sets of equations. 
The weights are chosen so that in the weighted set of equations we have y'l = y'2 = z'^ = Q. This 
is in line with the fact that y,z, fi ^ C for most of the algorithm. We verify that the expressions 
for the weights are non-negative. We then verify that w.h.p. the sliding trajectory and the process 
parameters remain close. 



Our next aim is to show that w.h.p. there is a time T such that y{T) = 0,z{T) = 0,{n). It 
would therefore be most natural to show that for the sliding trajectory, there is a time T such 
that y{T) = 0, z{T) = Q(n). The equations for the sliding trajectory are complicated and we have 
not been able to do this directly. Instead, we have set up an approximate system of equations (in 
parameters y, z, jl) that are close when c > 15. We can prove these parameters stay close to y, z, ji 
and that there is a time T such that y{T) = 0, z{T) = J7(n). The existence of T is deduced from 
this and then we can deduce the existence of T. We then in Section [9] show that w.h.p. 2greedy 
creates a matching with O(logn) components, completing the proof of Theorem [TJ 

Section [10] shows how to use an extension-rotation procedure on our graph G to find a Hamilton 
cycle within the claimed time bounds. This procedure works by extending paths one edge at a time 
and using an operation called a rotation to increase the number of chances of extending a path. It 
is not guaranteed to extend a path, even if it is possible some other way. There is the notion of a 
booster. This is a non-edge whose addition will allow progress in the extension-rotation algorithm. 
The companion paper [14] shows that for c > 2.67 there will w.h.p. always be many boosters. To 
get the non-edges we first randomly choose s = v}'"^ log" n random edges X of G, none of which 
are incident with a vertex of degree three. We then write G = G' + X and argue in Section 110.11 
that the pair {G\X) can be replaced by {H,Y) where H = G^-^.^ and F is a random set of 
edges disjoint from E{H). We then argue in Section [10.31 that w.h.p. Y contains enough boosters 
to create a Hamilton cycle within the claimed time bound. 

Section [TT] contains some concluding remarks. 

3 Random Sequence Model 

A small change of model will simplify the analysis. Given a sequence x = xi, X2, . . . , X2m G [n]^^ 
of 2M integers between 1 and N we can define a (multi)-graph Gx = Gx(-/V, M) with vertex set 
[N] and edge set {(x2i-i, X2i) : 1 < i < M}. The degree dx(^) of f € [A^] is given by 

d^{v) = \{3e[2M]:xj = v]\. 

If X is chosen randomly from [A]^ then Gx is close in distribution to Gn,m- Indeed, conditional 
on being simple, Gx is distributed as Gi\i,M- To see this, note that if Gx is simple then it has vertex 
set [N] and M edges. Also, there are M!2 distinct equally likely values of x which yield the same 
graph. 

Our situation is complicated by there being lower bounds of 2,3 respectively on the minimum 
degree in two disjoint sets J2, J3 ^ [A]. The vertices in Jq = [A] \ J2 U J3 are of fixed degree 
bounded degree and the sum of their degrees \s D = o{N). So we let 

[N?j^'j^;D = {x G [Nf'' : d^{j) > i for j eJ„i = 2,3 and J^ d^{j) = D}. 

jeJo 

Let G = G{N, M, J2, J3; D) be the multi-graph Gx for x chosen uniformly from [Ajj^^j .j-,. It is 
clear then that conditional on being simple, G(n, ?7i, 0, [n];0) has the same distribution as G^^. 
It is important therefore to estimate the probability that this graph is simple. For this and other 



reasons, we need to have an understanding of the degree sequence dx when x is drawn uniformly 
from [Nfj^lj^.^j,. Let 

i=0 

for fc > 0. 

Lemma 3.1. Let x be chosen randomly from [-/V] j^^j .£)• For i = 2, 3 let Zj (j G [Jj]) 6e independent 
copies of a truncated Poisson random variable Vi, where 

m = t) = ^y t = M + l,.... 

Here A satisfies 

For j G Jo, Zj = dj is a constant and X^j-gj^, dj = D. Then {'ix(j)}je[Ar] is distributed as {Zj}jg[7v] 
conditional on Z = ^,g[„i Zj = 2M . 

Proof Note first that the value of A in ([T|) is chosen so that 

E(Z) = 2M. 

Fix Jo, J2, J3 and ^ = (^1, ^2, • • • , ^n) such that ^j = dj for j G Jq and ^j > A; for A; = 2, 3. Then, 



xG[Af]2« 



On the other hand, 

(Zi, Z2,. . . ,Zn) 












'J2^J3;o 



,^e[N]f' 



P(dx = ^). 



D 

To use Lemma 13.11 for the approximation of vertex degrees distributions we need to have sharp 
estimates of the probability that Z is close to its mean 2M. In particular we need sharp estimates 
of P(Z = 2M) and P(Z — Z\ = 2M — k), for k = o{N). These estimates are possible precisely 



because E(Z) = 2M. Using the special properties of Z, we can refine a standard argument to show 
(Appendix 1) that where N^ = | J^l and N* = N2 + N^ and the variances are 

Oe = „ ,. ,„ = A— - 



/£(A)2 d\ V /£(A) 

anda2 = -i-^iV,a| (2) 



3 
that if iVV^ -> 00 and A: = 0{\fN*a) then 



P(Z = 2M-fc)= i ('i + o('^^^y (3) 

aV2^^ V \N*a^ ) ) 

A proof for J2 = [N] was given in the appendix of [3]. We need to modify the proof in a trivial 
way. Given ^ and 

al = 0{\), ^ = 2,3, 

we obtain 

Lemma 3.2. Zet x 6e chosen randomly from [N]jj .j^. 

(a) Assume that \ogN* = 0{{N*\Y/'^). For every j G Je and i <k < logN*, 

Furthermore, for all ^1,^2 G {2, 3} and ji G Jei,J2 £ <^^2> ii / i2i a^c^ h l^ki < logiV*, 

(b) 

<i.O)<-Jp^ ,.0 (6) 

(log log A*) ^/^ 

/or a// j G J2 U J3 . 
Proof Assume that j = 1 ^ Jq. Then 



P(dx(l) = A:) 



Zi = k and ^^^^ Z^ = 2M 






^;l2 Z, = 2M - k 



Likewise, with ji = 1,J2 = 2, 

A^'i Xk2 IP (E^s ^i = 2M - fci - k2) 



¥{d^{l) = ki,d^{2) = k2) 



^An event £^ — £{N*) occurs quite surely (q.s., in short) if F{£) = 1 — 0(N ") for any constant a > 



kilfe,{X) k2\fe,{X) p (^y^^i Z^ = 2M 



Statement (a) follows immediately from ([3]) and (b) follows from simple estimations. D 

Let i^^is) denote the number of vertices in J^,£ = 2,3 of degree s in Gx- Equation i^ and a 
standard tail estimate for the binomial distribution shows 

Lemma 3.3. Suppose that log A^* = 0{{N*X)^/'^) and A'^ — )• oo with N . Let x be chosen randomly 
from [A^ljjfjg./). Then q.s., 



P(x) 






j!/(A) 



NeX^ V^"^^ 



fw) 



< 1 + -V77^ log N, k<j<logN}. (7) 



D 

We can now show G^, x G ["■Ifl'uiio ^^ ^ good model for G„^. For this we only need to show now 
that 

P(Gx is simple) = J^(1). (8) 

For this we can use a result of McKay [19]. If we fix the degree sequence of x then x itself is 
just a random permutation of the multi-graph in which each j G [n] appears dx(i) times. This 
in fact is another way of looking at the Configuration model of Bollobas [S]. The reference |19j 
shows that the probability Gx is simple is asymptotically equal to e"^^''""'^^^^'''''"^^ where p = m2/m 
and m,2 = Y2je\n\ '^x(j)('^x(j) — !)■ One consequence of the exponential tails in Lemma [STSl is that 
1712 = 0{m). This implies that p = 0(1) and hence that ([8]) holds. We can thus use the Random 
Sequence Model to prove the occurrence of high probability events in G^^. 

With this in hand, we can now proceed to describe our 2-matching algorithm. 

4 Greedy Algorithm 

Our algorithm will be applied to the random graph G = G„^ and analysed in the context of Gx. 
As the algorithm progresses, it makes changes to G and we let T denote the current state of G. 
The algorithm grows a 2-matching M and for v G [n] we let h[v) be the 0/1 indicator for vertex v 
being incident to an edge of M. We let 

• /i be the number of edges in F, 

• Voj = {ve [n] : dv{v) = 0, h{v) = j}, j = 0, 1, 

• Yk = {ve [n] : dr{v) = k and b{v) = 0}, /c = 1, 2, 

• Zi = {v £ [n] : dr{v) = 1 and b{v) = 1}, 

• Y = {v £ [n] : dr{v) > 3 and b{v) = 0}, This is J3 of Section [3l 

• Z = {v £ [n] : dr{v) > 2 and b{v) = 1}, This is J2 of Section [3l 

• M is the set of edges in the current 2-matching. 

• M* is the matching induced by the path components of M i.e. if P C Af is a path from x to 
y then (x, y) will be an edge of M* and the internal edges of P will have been deleted from 

r. 

7 



Observe that the sequence b = {b{v)) is determined by Vo,0) Vo,i, Yi, I21 Zi, Y, Z. 

If Yi 7^ then we choose i; G Yi and add the edge incident to f to M, because doing so is not a 
mistake i.e. there is a maximum size 2-matching of T that contains this edge. If li =0 and I2 7^ 
then we choose v G Y2 and add the two edges incident to v to M, because doing so is also not a 
mistake i.e. there is a maximum size 2-matching of T that contains these edges. Similarly, if 12 = 
and Zi 7^ we choose v £ Zi and add the unique edge of T incident to v to M. When we add an 
edge to M it can cause vertices of T to become internal vertices of paths of M and be deleted from 
r. In particular, this happens to v £ Zi in the case just described. When Yi = Y2 = Zi = f/i ^ Y we 
choose a random edge incident to a vertex of y. In this way we hope to end up in a situation where 
Y2 = Zi = Y = fji and \Z\ = 0(n). This has advantages that will be explained later in Section [9] and 
we have only managed to prove that this happens w.h.p. when c > 15. When Yi = Y2 = Zi = Y = (/> 
we are looking for a maximum matching in the graph F that remains and we can use the results of 

m- 

We now give details of the steps of 

Algorithm 2GREEDY: 

Step 1(a) Yi /0 

Choose a random vertex v from Yi. Suppose that its neighbour in T is w. We add {v, w) to 
M and move v to Vo,i. 

(i) If b{w) = then we add {v, w) to M* . If w is currently in Y then move it to Z. If it is 
currently in Yi then move it to Vo,i. If it is currently in Y2 then move it to Zi. Call this 
re- assigning w. 

(ii) If b{w) = 1 let li be the other end point of the path P of M that contains w. We remove 
{w,u) from M* and replace it with {v,u). We move w to Vo,i and make the requisite 
changes due to the loss of other edges incident with w. Call this tidying up. 

Step 1(b): Fi = and I2 / 

Choose a random vertex v from Y2. Suppose that its neighbours in F are wi,W2- 

II wi = W2 = V then we simply delete v from F. (We are dealing with loops because we are 
analysing the algorithm within the context of Gx. This case is of course unnecessary when 
the input is simple i.e. for G„-^). 

Continuing with the most likely case, we move v to Vo,i. We delete the edges 
(v, wi), {v, W2) from F and place them into M. In addition, 

(i) If b{wi) = b{w2) = then we add (loi, W2) to M* and put b{wi) = b{w2) = 1. Re-assign 

Wi,W2. 

(ii) If b{wi) = b{w2) = 1 let ni,i = 1,2 be the other end points of the paths -Pi,i-*2 of M 
that contain wi.,W2 respectively. There are now two possibilities: 

(1) Ml = W2 and M2 = wi. In this case, adding the two edges creates a cycle C = 
{v,wi,Pi,W2,v) and we delete the edge {wi,W2) from M* . Vertices wi,W2 are 
deleted from F. The rest of C has already been deleted. Tidy up. 



(2) ui 7^ W2 and U2 ^ wi. Adding the two edges creates a path 

{ui, P[,wi,v,W2, P2,U2) to M, where P[ is the reversal of Pi. We delete the edges 
(wi,ui), {w2,U2) from M* and add (ui, U2) in their place. Vertices wi,W2 are deleted 
from r. Tidy up. 

(iii) If b{wi) = and b{'W2) = 1 let U2 be the other end point of the path P2 of Af that 
contains W2- We delete {w2,U2) from M* and replace it with {wi,U2)- We put b{wi) = 1 
and re-assign it and delete vertex -012 from F. Tidy up. 

Step 1(c): ¥2 = ^ and Zi / 

Choose a random vertex v from Zi. Let u be the other endpoint of the path P of M that 
contains v. Let w be the unique neighbour of v in F. We delete u from F and add the edge 
{v, w) to M. In addition there are two cases. 

(1) If b{w) = then we delete {v,u) from M* and replace it with {w,u) and put b{w) = 1 
and re- assign w. 

(2) If b{w) = 1 then let u be the other end-point of the path containing w in M. If u ^ v 
then we delete vertex w and the edge {u,w) from M* and replace it with {u,v). Tidy 
up. If n = f then we have created a cycle C and we delete it from F as in Step l(b)(i)(l). 

Step 2: y^ = ^2 = Zi = and y / 

Choose a random edge {v, w) incident with a vertex v £Y. We delete the edge {v, w) from F 
and add it to M. We put b{v) = 1 and move it from Y to Z. There are two cases. 

(i) If b{w) = then put b{w) = 1 and move it from Y to Z. We add the edge {v, w) to M* . 

(ii) If b{w) = llet u be the other end point of the path in M containing w. We delete vertex 
w and the edge {u,w) from M* and replace it with {u,v). Tidy up. 

Step 3: Yi = Y2 = Zi = Y = ^ 

At this point F will be seen to be distributed as G^^^ for some u, /U where ^ = 0{v). As such, 
it contains a (near) perfect matching M** [T3] and it can be found in 0{v) expected time 

m. 



The output of 2GREEDY is set of edges in M U M** . 

No explicit mention has been made of vertices contributing to Vo,o- When we we tidy up after 
removing a vertex u;, any vertex whose sole neighbour is w will be placed in Vo,o- 

5 Uniformity 

In the previous section, we described the action of the algorithm as applied to F. In order to prove 
a uniformity property, it is as well to consider the changes induced by the algorithm in terms of x. 

When an edge is removed we will replace it in x by a pair of •'s. This goes for all of the edges 
removed at an iteration, not just the edges of the 2-matching M. Thus at the end of this and 
subsequent iterations we will have a sequence in A = ([n] U {:*r})^'" where for all i, X2i-i = * if and 
only if X2i = *■ We call such sequences proper. 

9 



We use the same notation as in Section [3j Let S = S{x.) = {i : Z2i-i = Z2i = *}. Note that the 
number of edges /i in Gx is given by 

H = m — \S\. 

For a tuple v = (Vo,Oi Vo,i, Yi, Y2, Zi,Y, Z, S) we let Av denote the set of pairs (x, b) where x £ A 
is proper and 

. Voj = {ve [n] : d^{v) = 0, b{v) = j}, j = 0, 1, 

• Yk = {v £ [n] : dy^{v) = k and b{v) =0}, k= 1,2, 

• Zi = {v € [n] : (ix(^) = 1 and b{v) = 1}, 

• Y = {ve[n]: d^{v) > 3 and b{v) = 0}, 

• Z = {ve[n]: d,,{v) > 2 and b{v) = 1}. 

• S = S{x). 

(Re-call that b is determined by v). 

For vectors x, b we define v(x, b) by (x, b) G A^^^.b)- We also use the notation x G Ay/^) when 
the second component b is assumed. 

Given two sequences x, x' G A, we say that x' C x if Xj = -k implies x'- = *. In which case we define 
y = X - x' by 

{Xj If Xj ^ -k = x'- 
•k Otherwise 
Thus y records the changes in going from x to x'. 

Given two sequences x, x' G A we say that x, x' are disjoint if Xj 7^ • implies that x'- = •. In which 
case we define y = x + x' by 



Vj 



If x'j / • 
Otherwise 



x'j If x'j / • 



Thus, 

if x' C X then x' and x — x' are disjoint and x = x' + (x — x'). (9) 

Suppose now that (x(0), b(0)), (x(l), b(l)), . . . , (x(t), b(t)) is the sequence of pairs representing the 
graphs constructed by the algorithm 2GREEDY. Here x(i — 1)5 x(z) for i > 1 and so we can define 
y{i) = x{i - 1) - x(i). Suppose that v(i) = v(x(z)) for 1 < i < f where v(0) = (0, 0, 0, 0, [n], 0, 0) 
and b(0) = 0. 

Let 

Av|b = {x:(x,b)GAv}. 

Lemma 5.1. Suppose that x(0) is a random member of Ay,fQ\ii,tQ\ . Then given 
v(0), v(l), . . . ,v(t), the vector x(t) is a random member 0/ Av(4)|b(4) for all t > 0, that is, the 
distribution ofx{t) is uniform, conditional on the edges deleted in the first t steps. (Note that b(f) 
is fixed by v(t) here). 

10 



Proof We prove this by induction on t. It is trivially true for t = 0. Fix t > 0, x(t), b(t), x(t + 

1), b(t + 1). We define a sequence x(t) = zi, Z2, . . . , z^ = x(t + 1) where z^+i is obtained from Zj 
by a basic step 

Basic Step: Given x, b and v = v(x, b) we create new sequences x' = Aj{x),h' = Bj{h) and 
v' = v(x',b'). Let w = X — x'. A basic step corresponds to replacing the edge {w2j~i,W2j) by an 
edge of the matching M, for some index j. Let u = W2j-\,v = W2j- 

Case 1: Here we assume b{u) = b{v) = 0. 

Replace X2j~i,X2j by Vs and put b{u) = b{v) = 1. 

Case 2: Here we assume b{u) = 0, b{v) = 1. 

Replace X2k-i,X2k by Vs for every k such that v G {x2k-iTX2k} and put b{u) = 1. 

Case 3: Here we assume b{u) = b{v) = 1. 

Replace W2k-i,'W2k by •'s for every k such that {u,v} n {w2k-i,W2k} / O- 

Claim 2.1. Suppose that x' = Aj{x) and y = x — x' and W = Bj(h). Then the map </> : z S 

A^. , , — 7- (z — y, b') is 1-1 and each (z',b') E Av(x',b') ^■^ ^^e image under (j) of a unique member 
o/^v(x,b)' «^^ere A^(x,b) = K^'b) S ^v(x,b) : z 5 y}- 

Proof of Claim 12.11 Equation ^ implies that (f) is 1-1. Let v = v(x, b) and v' = v(x',b'). 
Choose (w,b') G Av'. Because S' is determined by v', we see that y and w are necessarily disjoint 
and we simply have to check that if x* = w + y then (x*,b) G Av But in all cases, v(x*,b) is 
determined by v' and y and this implies that v(x*,b) = v(x, b). 

This statement is the crux of the proof and we should perhaps justify it a little more. Suppose 
then that we are given v' (and hence b') and y and b. Observe that this determines d-^*{v) for all 
V G VqqUVqiUYIuY2^ Z'^. Together with b{v) this determines the place of v in the partition 
defined by v. Now y' C y and it only remains to deal with v G Z' . If dy{v) > then v £ Y L) Z 
and b{v) determines which of the sets v is in. If dy{v) = and b{v) = 1 then v £ Z. If dy{v) = 
and b{v) = then v £ Y. This is because b{v) = and b'{v) = 1 implies that we have put one of 
the edges incident with v into M. 
End of proof of Claim 12.11 

The claim implies (inductively) that if x is a uniform random member of A^iy^ and we do a sequence 
of basic steps involving the "deletion" of yi, y2, . . . , ys where y^+i ^ x — yi — • • • y^, then x' = 
X — yi — • • • — y^ is a uniform random member of A^/ib') where v' = v(x', b') for some b'. This will 
imply Lemma |5. II once we check that a step of 2greedy can be broken into basic steps. 

First consider Step 1(a). First we choose a vertex in x G Yi. Then we apply Case 1 or 2 with 
probabilities determined by v. 

Now consider Step 1(b). First we choose a vertex in x G l2- We can then replace one of the edges 
incident with a; by a matching edge. We apply Case 1 or Case 2 with probabilities determined by 
V. After this we apply Case 2 or Case 3 with probabilities determined by v. 

For Step 1(c) we apply one of Case 2 or Case 3 with probabilities determined by v. 

For Step 2, we apply one of Case 1 or Case 2 with probabilities determined by v. 
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This completes the proof of Lemma I5.1[ D 

As a consequence 

Lemma 5.2. The random sequence v(t), t = 0,1,2, ..., is a Markov chain. 
Proof Shghtly abusing notation, 

P(v(t + l)|v(0),...,v(t)) 
= Yl lP(w'|v(0),...,v(t)) 

= E E IP(w',w|v(0),...,v(t)) 

= E E IP(w'|v(0),...,v(t-l),w)P(w|v(0),...,v(t)) 

x'eAv(t+i) weAv(t) 

= y^ y^ P(w' I w)|Av(()|^ , using Lemma \5A[ 

w'GAv(t+i) weAv(t) 

which depends only on v(t), v(t + 1). D 

We now let 

|v| = {|Vo,o|, |l^o,il> l^il, l>2|, \Zi\, \Y\, \Z\, \S\} . 

Then we let A|y| denote the set of (x, b) G A with |v(x, b)| = |v| and we let 

A|v||b = {x : (x,b) G A|v|}. 

It then follows from Lemma 15.21 that by symmetry, 

Lemma 5.3. The random sequence |v(t)|, t = 0,1,2, ..., is a Markov chain. 

A component of a graph is trivial if it consists of a single isolated vertex. 

Lemma 5.4. Whp the number of non-trivial components of the graph induced by M D M** is 
O(logn). 

Proof Lemma 3 of Frieze and Luczak |13j proves that w.h.p. the union of two random (near) 

perfect matchings of [n] has at most 3 log n components. Lemma 15.11 implies that at the end of 
Phase 1, r is a copy of G]^^ , independent of M* . In which case the (near) perfect matching of F 
is independent of M* and we can apply |13] . D 



6 Conditional expected changes 

We now set up a system of differential equations that closely describe the path taken by the 
parameters of Algorithm 2greedy, as applied to Gx where x is chosen randomly from [ji-JffllJji.n- 
We introduce the following notation: At some point in the algorithm, the state of F is described 
by X G Wlj^2^j'.-D^ together with an indicator vector b. We let yi = \{v : dx(f) = i and b{v) = 0}| 
and let Zi = \{v : dx(t') = i and b{v) = 1}| for i > 0. We let y = X^j>3 Vi and z = X^j>2 ^i and let 
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2^ = Yli>o'''iyi + ^i) be the total degree. Thus in the notation of Section U] we have yi = \Yi\,i = 
1, 2, J3 = Y, N^ = y, zi = \Zi\, J2 = Z, N2 = z, D = Hi + 2y2 + zi, M = fi. Then it follows from 
Lemma [STSl that as long as [y + z)X = il(log n), we have q.s., 

Here A is the root of 

A/2(A) , A/i(A) 

n 

Notational Convention: There are a large number of parameters that change as 
2GREEDY progresses. Our convention will be that if we write a parameter ^ then by default it 
means i{t), the value of £, after t steps of the algorithm. Thus the initial value of ^ will be ^(0). 
When ^ is evaluated at a different point, we make this explicit. 

We now keep track of the expected changes in v = {yi,y2, y, zi, Z2, fi) due to one step of 2greedy. 
These expectations are conditional on the current values of b and the degree sequence d. We let 
N = y + z, which is a small departure from the notation of Section [3l In the following sequence 
of equations, S,' = ^(t + 1) represents the value of parameter ^ after the corresponding step of 
2GREEDY. 

Lemma 6.1. The following are the expected one step changes in the parameters 
iyi:y2,y,zi,z, fi). We will compute them conditional on the degree sequence d and on |v|. We give 
both, because the first are more transparent and the second are what is needed. The error terms e? 
are the consequence of multi-edges and we will argue that they are small. We take 

N = y + z. 

Step 1. yi + y2 + zi > 0. 
Step 1(a). yi > 0. 



.,,_,,„,<, . _(M,j;|.„_,|iUE§(«^-i)t.g (14) 



fc>2 "^ '^ / fc>2 



Ete-»l|v|] = ---—^^^ + —^^^^^^ + 0[^^^] (15) 
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E[4 - ^1 I b, d] 
EK - ^1 I |v|] 



zx yr^kzk 1x^1 \ , sr^kzk 2z2 



2/. ^^ 2// 2/. 



fc>2 



2/x 



2/x 



zi ziz AVo(A) ^ z^ AVo(A) ^ ^^log^JV 



2fi V /2(A) V /2(A)2 



XN 



(16) 
(17) 



E[y'-y|b,d] 
Eb' - 2/ I |v|] 



y^ + y^ik-1)^ 

^^ 2u -^ 2u ^ ^ 2/x 

^A;>3 ^ k>2 ^ ^ 

y A/2 (A) yz A^ AVo(A) 
2/i /3(A) 8/^2 /3(A) /2(A) 



+ 123 

log^TV 



+ 



XN 



(18) 
(19) 



E[z'-z I b,d] 
E[z' - z I |v|] 



Ekyk s^kzk sr^kzk .2z2 , 

?/ A/2(A) _ z A/i(A) _ ^ AVo(A) / log^JV 

2^ /3(A) 2^ /2(A) V /2(A)2 + V AiV 



(20) 
(21) 



fc>2 



E[/.'-^|b,d] = -1-^ 

E[^'-^i||v|] = -1 
Step 1(b). yi = 0,y2>0. 



kzk 
2;U 



l)+< 



2/z /2(A) + V AiV 



fezfc 



2^2 



E[,l-yi|b,d] = 2Ei:(^-i)f7 + 1M 



A:>2 



2// 



^2^ A2^(A) /log^iV 



^' /2(A) 



AiV 



(22) 
(23) 

(24) 
(25) 



E[y'2-y2\h,d] 

Hy2 - y2 1 |v|] 



■'-it^E|?(*-)tK^Et(^ 



i)|:+iM(26) 



log^iV 



2^2 j/2^ A2/o(A) yz A^ A2/o(A) 
/. /i2 /^(A) ^V/3(A) /2(A) + V AiV 



(27) 



E[z[ - zi I b, d] 
E[z[ - zi I |v|] 



Zl 



Ekzk 

fc>2 ^ 






Zl 



2/i 

£1 _ ziz X^fojX) 

/x 2^2 j2(A) ' 2^2 j2(A) 



+ 



2/x 
z' AVo(A) 



A;>2 ^ 



.2^2 



+ 



log2iV 

AAf 



(28) 
(29) 
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E[y'-y\h,d] 

ny'-y\W\] 



, fc>3 ^ k>2 ^ 



1 



3y3 



y A/2 (A) _ yz 

V /3(A) V /3(A) /2(A) 



2/i 
A3 A2/o(A) 



+ 






XN 



(30) 
(31) 



E[z'-z I b,d] 
E[z' - z I |v|] 



2;U 



E 



21E 

ifc>3 '" fc>2 ' fc>2 

.2 \4 



2/X 



^7:7(^-1)— I +111 



2[i 



2^1 



yA/2(A) zA/i(A) z^ AVo(A) ^ ^ ^log^iV 



fl /3(A) ;U /2(A) 2/.2 /2(A)2 



AiV 



(32) 
(33) 



E[//'-/x I b,d] 
E[/.' - fi I |v|] 



fcZfc 



2-2Eir(^-i)+lM 



-2 — 



2/i 
^ AVo(A) 



fc>2 
!i2. 



/^ /2(A) 



+ 



Step 1(c). yi=y2 = 0, Zi > 0. 

E[|/;-yi|b,d] = 0(^)- 

^[y'l - yi I |v|] 



°'^ 



log^iV 
AiV 



(34) 

(35) 
(36) 

(37) 
(38) 



E[y^-y2|b,d] 
E[2/2 - y2 I |v|] 



y^(fc-i)^ + 

^2 '^ 2;. 

^ ^;^ AVo(A) / log^JV 

8/i2 /3(A) /2(A) ^ V AiV 



(39) 
(40) 



K[z[-zi I b,d] 
E[z[ - zi I |v|] 



2/i ^ 2u ^ 

^ /t>2 ^ 



i)^ + y^(A: 

v2 \4. 



x2z2 



1 _ il _ fl£ ^ /oW , ^' AVo(A) /loK^iV 



2^ 4^2 j2(A) 4/.2 /2(A)2 



(41) 
(42) 



E[y'-y|b,d] 



kyk 



kzk 



2u ^ 2n 
fc>3 ^ fc>2 '^ 






(43) 
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E[y' - y I |v| 



y A/2(A) yz X^ AVo(A) ^ ^ ^log^iV 



2^. /3(A) 8^.2 /3(A) /2(A) 



XN 



(44) 



E{z' - z I b, d] 
E[z' - z I |v|] 






2n ~ ^ 2u ^-^ 2ii 

fc>3 '^ A:>2 '^ fc>2 ^ 

J^ A/2(A) _ z A/i(A) _ ^ AVo(A) / log^JV 

2/i /3(A) 2^ /2(A) 4^2 ^2(^)2 ^ ^^ 



(45) 
(46) 



kz-k 



E[^'-/.|b,d] = -l-^_fi(A:-l) + £[g 



fc>2 



E[^' - ^i I |v| 



Step 2. yi = y2 = Zi = 0. 



^ z A2/o(A) ^ ^/log^JV 



2;u /2(A) 



AiV 



E[yi-yi|b,d] 



'''^ 



IEbi-yi||v|] = O 



(47) 
(48) 



(49) 
(50) 



%^-y2|b,d] 

^^2 - y2 1 |v|] 



yz A3 A2/o(A) / log^JV 

8//2/3(A) /2(A) + V AiV 



(51) 
(52) 



E[z[ - zi\h, d] 
E[z[ - zi I |v|] 



^A^MAl /log^ 

4^2 /2(A)2 + V AiV 



(53) 
(54) 



E[y' - y I b, d] 
ny' - y I |v|] 



, X^kyk sr^ kzk . ,3y3 



-1- — 



fc>3 '■^ fc>2 

y A/2(A) yz A^ A2/o(A) 



2/i /3(A) 8/i2 /3(A) /2(A) 



+ 



log2iV 
AAf 



(55) 
(56) 



E[. -.|b,d] = i-L^-L^(^-i)^ + L^ + 

fc>2 '^ fc>2 '^ ^ fe>3 ^ 



5? 



(57) 
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nz' 



Z A/i(A) Z2 ^4^^(^) y ^j^^^) 



+ 



2^ /2(A) V /2(A)2 2/i /3(A) 



+ 



\N 



(58) 



E[/i' - /i I b, d] 

W-A\A\ 



-Elfc^ 



fc>2 



1) + 



^ z AVo(A) ^ ^/log^iV 



2/x /2(A) 



AiV 



(59) 
(60) 



Proof The verification of (J12p - (j59p is long but straightforward. We will verify (|12p and (|13p 

and add a few comments and hope that the reader is willing to accept or check the remainder by 
him/herself. 

Suppose without loss of generality that x is such that xi = v = 1 £ Yi. The remainder of x is a 
random permutation of 2m — 2/i *'s and 2fj, — l values from [n] where the number of times j occurs 
is dx(j) for j G [n]. The term -1 accounts for the deletion of v from F. There is a probability 
' that X2 G Yi and this accounts for the second term in (1120 . Observe next 



3/1 



2^-1 



f +0 



fczfc 



that there is a probability JH^ that 0:2 G -^fcj^ ^ 2. In which case another k — 1 edges will be 
deleted. In expectation, the number of vertices in Yi lost by the deletion of one such edge is ^^~ 
and this accounts for the third term. On the other hand, each such edge has a 



2y2 

2/X-3 



2^-3 

probability of 



being incident with a vertex in Y2 . The deletion of such an edge will create a vertex in Yi and this 
explains the fourth term. We collect the errors from replacing ^u by /i — 1 etc. into the last term. 
This gives a contribution of order 1/A^. The above analysis ignored the extra contributions due to 
multiple edges. We can bound this by 

m=y.'^y.^(''.-')(^)'"- (61) 



fc>3 



£>3 



1 



2^1 -k 



To explain this, we assume X2 G Z^, which is accounted for by the first sum over k. Now, to create 
a vertex in Yi, the removal of X2 must delete £ — 1 of the edges incident with some vertex y in 1^. 
The term „ ^, is the probability that the first of the chosen i — 1 edges is incident with y G 1^ and 

the factor ( ^ _^ J bounds the probability that the remaining i — 2 edges are incident with y. 

To go from conditioning on b,d to conditioning on |v| we need to use the expected values of y^, zi 
etc., conditional on v. For this we use (j4]) and ([5]). 

We have, up to an error term O 






E 


) ]kyk 

k>3 


V 


E 


y]kz, 

k>2 


V 



Y^ky 
fc>3 

k>2 



X' 



.>3 ^'/3(A) fsiX)^^jl ' hW 



E 



^k{k- l)yk 



k>3 



Y,Hk-l 



X'' _ zX ^AJ 
X'' yX^ 



A/i(A) 
/2(A) ■ 



fe>3 



^!/3(A) /3(A) ^^j! ^ /3(A) ■ 



(62) 
(63) 
(64) 
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E 



E 

fc>2 



k{k - l)zk 



^Hk-l) 



k>2 



fc!/2(A) " /2(A) ^^ j! 



A^' A2/o(A) , , 



In particular, using (|65|) in (|12p we get p3|) . The other terms are obtained in a similar fashion. We 
remark that we need to use ([5]) when we deal with products Zkl/i, k > 2 and ^ > 3. 

Since, k,i< logn in §B) we see, with the aid of ([62]) ~ ([65]) that E[r^ \ v] = 0{1/N). This bound 
is true for all other e?. D 



6.1 Negative drift for yi,y2,Zi 

Algorithm 2GREEDY tries to keep yi,y2,zi small by its selection in Step 1. We now verify that 
there is a negative drift in 

C = C{t) = yi + 2y2 + zi 

in all cases of Step 1. This will enable us to show that w.h.p. (" remains small throughout the 
execution of 2greedy. Let 



Then simple algebra gives 



E[C'-Cl|v|] 
IE[C' - C I |v|] 

nc - c I iv|] = - 

We will show 
Lemma 6.2. [Pittel] 

and 



n = n( ^ _ ?/^ A3 AVo(A) z' AVo(A) 

'^ "^^ > V /3(A) /2(A) + V /2(A)2 ■ 

^A2/o(A)- 



-(l-Q)-(C + y2 
-2(l-Q)-cf- + 



1 



+ 



2/i V/2(A) 
^A2/o(A) 



/" 2/.2/2(A) 



+ 



^' "^^ '^(2/. + V/2(A) '+'' 



+ - 

/ V 


og^N 
XN 


/log^N' 


\ 


V AiV 


J 


flog^N' 


\ 



XN 



Q 



A > implies Q < 1 

_JO(A-i), A ^00, 

~1l-e(A2), A^O. 



Proof 



Now, by ([TT]) . Q < 1 is equivalent to 



yz 



or, introducing x = y/z, 



AVo(A) 2 AVo(A) 

/2(A)/3(A)^^ /2(A) 



^ AMA)+^AA(A) 
2 ^^/3(A) +"/2(A) 



Fix,X):-- 



A5/o(A) A4/o(A) 

^/2(A)/3(A) "^"MAp- 



. A/2(A) A/i(A) 



< 1, VA>0,x>0. 



(66) 

Case 1(a) (67) 
Case 1(6) (68) 
Case 1(c) (69) 



(70) 
(71) 



(72) 
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In particular, i<'(oo,A) = 0. Now 



F,(.,A)=(.^ + ^j G(.,A), 



where 

^ ' ^ " /2(A)/3(A) V /3(A) + /2(A) 

_ 2 (r^M^ + ^Ai^^ ^M^ r, AVo(A) AVo(A) \ 

V /3(A) + /2(A) y /3(A) r/2(A)/3(A)+ /2(A)2;- ^'^^ 

Notice that 

G(0,A) = AVo(A)/i(A)/2(A)-3/3(A)-i(A/i(A) -2/2(A)) > 0, 

as A/i(A) — 2/2 (A) > 0. Whence Fx{0, A) > and as a function of x, F{x, A) attains its maximum 
at the root of G{x, A) = 0, which is 

_ _ /3(A)(A/i(A)-2/2(A)) 

A/2(A)2 ■ ^'^^ 



Now, ([73)1 impHes that x satisfies 



- A5/o(A) AVo(A) ^ A5/o(A) /_ A^A) A/i(A) \ /3(A) 
/2(A)/3(A) ^ /2(A)2 /2(A)/3(A) V /3(A) ^ /2(A) ; 2A/2(A) ^ ' 



and (jTlj) imphes that 



_ A/2(A) A/i(A) _ 2(A/i(A)-/2(A)) 
/3(A) + /2(A) /2(A) • ^'^^ 



Substituting ([TSD, ([76]) into ([72]), we see that 

/2(A)/3(A)2A/2(A) _ A /o(A) 



F(:c,A) 



x 



A/^ + A^^) 4/2(A)(A/i(A) - /2(A)) 



Thus, 



where 



^-^^^'^^^ 4/2(A)(A/';|Ai-/2(A)) ' ^^^) 



D{X) =4/2(A)(A/i(A) - /2(A)) - AVo(A) 

= _ 4 - 4A - (A^ + 4A2 - 8)e^ + (4A - 4)6^^. 



In particular, 



l-F(x,A) = 1-0(A^^), A ^00. (78) 

Expanding e^and e^^, we obtain after collecting like terms that 

i>6 ■^' 
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where 

d, =2^+i(j-2)-(j)4-4(j)2 + 8. 

Here dj = for < j < 5 and 4 = 40,^7 = 280,4 = 1176,4 = 3864, 4o = 10992 and 
dj > for j > 11 is clear. Therefore D{X) is positive for aU A > 0. Since D(X) ~ 4-^^ ^-iid 
4/2(A)(A/i(A) - /2(A) ~ A^ as A ^ 0, we see that 

1 - F(x, A) ~ d6A^ A^O. (79) 

This completes the proof of Lemma 16.21 D 

It follows from (|67l) . (i68]) . (f69|) and Lemma [62] that, regardless of case, 

C > implies E[C' - C I |v|] < -ci(l A Xf + O O^j^) (80) 

for some absolute constant ci > 0, where 1 A A = min{l, A}. 

To avoid dealing with the error term in (|80p we introduce the stopping time, 

• j\ \2 ^ log^?^" 
mm < r : A < 



AiV 

(This is well defined, since eventually A^ = 0). 
The following stopping time is also used: 

To = min{t : A < 1 or iV < n/2} < Ter- 

So we can replace (i80]l by 

C > implies E[C' - C I |v|] < -ci/2, < t < Tq, (81) 

which holds for n sufficiently large. 

There are several places where we need a bound on A: 

Lemma 6.3. Whp A < 3ce for t < Tq. 

Proof We will show that w.h.p. yi + 2y2 + zi = o{n) throughout. It follows from (|lip and the 

inequalities in Section [8.0.21 that if A is sufficiently large and if X{t) > A then Y U Z contains y + z 
vertices and at least A(y + z)/2 edges and hence has total degree at least A{y + z). We argue that 
w.h.p. G does not contain such a sub-graph. We will work in the random sequence model. We can 
assume that |y U Zj > n/2. Now fix a set S C [n] where s = IS"! > n/3. Let D denote the total 
degree of vertices in S. Then 

"iH hds=di=l diH |-ds=a 

dj>3 dj>0 

\d„d 

dlfsW 
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Here A = A(0) and we are using Lemma [3. 11 The factor 0{n^''^) accounts for the conditioning that 
the total degree is 2cn. Now A(0) < 2c and /3(A(0)) > 1. It follows that 

P05:.>A.)<o(„'/.) E E (';)(^<o(„'/^) E E i^y'^ 

s>n/3d>As ^ ^ s>n/3d>As 

The terms involving d in the second sum are u^ = ^| and for d/s large we have Ud+i/ud = 0{s/d) 
and so we can put d = As in the second expression. After substituting d! > (d/e)'^ this gives 

F{3S:d>As)<0{n'/') J^ (^^^)' = o(l) 

s>n/2 ^ ^ 

if A > See. D 

Our aim now is to give a high probability bound on the maximum value that (^ will take during the 
process. We first prove a simple lemma involving the functions 4>j{x) = ^ r,^^ , j = 2,3. 

Lemma 6.4. 

1 

4>j{x) is convex and increasing and j < 4>j{x) and < (t)j{x) < 1 for j = 2,3. (83) 

J + 1 

Proof Now, if H{x) = ^qS then 

, _ G{x){xF'{x) + F{x)) - xF{x)G'{x) 

and 

H"{x) = 

2xF{x)G'{xf + G{xf{2F'{x) + xF"{x)) - G{x){2xF'{x)G'{x) + F{x){2G'{x) + xG"{x))) 

G{xf ■ 

Case j = 2: 

,, , , e^^ - (x^ + 2)e^ + 1 , ,, 



e^^ - (x^ + 2)e^ + 1 = 2^ ^ — '- 

i>4 

and so (p^ix) > for x > 0. 



^„. , e2^(x2-4x + 2) + e^(a;3 + a::2 + 4a::-4) + 2 , , 

0213;) = r^3 . (85) 



But 

.2x/ 2 ^„ , o\ , „x/ 3 , 2 



e^^(a;^ - 4x + 2) + e^(a;'^ + x^ + 4x - 4) + 2 



y^ 2^-HJiJ - 1) - 8J + 8) + J(i - 1)(J - 2) + i(j - 1) + 4j - 4 ^, 



j>6 -^ 
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and so (j)2{x) > for x > 0. 

Case j = 3: 

, , , le^"" - e^ (x^ - x^ + 4x + 4) + x^ + 4x + 2 

'^3(^) = V^-^ k-2 • ^^^) 

2(e^ — 1 — X — ^)"= 

But, 

2e^--e-(x3-x^ + 4x + 4)+x^ + 4x + 2 = ^^^"''-^'^^'-^^^^'-^;+^'^^'-^)-^^'-^x^- 

j>6 -^^ 

and so (/'3(x) > for x > 0. 

,„, , x(e2^(2x2 - 12x + 12) + e^(x^ + Sx^ - 24) + 2x2 + 12x + 12) 

4(e^ - 1 - X - ^)3 

But 



e2'^(2x2 - 12x + 12) + e'^(x'' + Sx^ - 24) + Ix"^ + 12x + 12 

^ 2^-HjU - 1) - 12j + 24) + Jij - l)(j - 2)(i - 3) + 8i(j - 1) - 24 ^ 

and so 4>'^{x) > for x > 0. 

So (j)2,4'3 are convex and so we only need to check that i?i>2(0) = 2,02(0) = 1/3, i?i>3(0) = 3,(p'^{0) = 
1/4 and<?;>'2(oo) = 0^(oo) = 1. D 

Consider A as a function of v, defined by 

#3(A) + #2(A) = n 

where IT = 2/i — yi — 2^2 — zi . 

We now prove a lemma bounding the change in A as we change v. 



Lemma 6.5. 



|A(vi) - A(V2)| = O { ''""^ J'^^' ) , fort< Ter. 



Proof We write vi = {yi,y2,zi,y,z,n) > and V2 = {yi + 5y^,y2 + Sy^,zi + 6z^,y + 6y,z + 

5z-,n + dfj,) > and 11, 11 + Jn for the two values of 11. Then 

{y + Sy)M^ + ^a) - #3(A) + {z + 6z)M^ + <^a) - hW = ^n- 
Convexity and our lower bound on (p'- implies that 

<Ai(A) > </>i(A + 6x) - Sx^'jiX + Sx) > <pj{X + 6x) - 6x. 
So from (|89p we have 

{y + Sy){MX) + Sx) - yMX) + {z + <5.)(02(A) + 6x) - MX) > -^n. 
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This implies that 



5u -6y(t>3{X) -(5202(A) 



y + 5y + z + 6z 
So, 

/ 1 |vx — V2I |l 

(^A < imphes \6x\ = O 



N 

Note that we use Lemma [6.3l to argue that (pj{X),j = 2, 3 are bounded within our range of interest. 
To deal with 5a ^ we observe that convexity implies 

</'j(A + 5A)></'i(A) + <5A0;(A). 

So from (j89p we have 

{y + 6y){M^) + Sx(t)'3{X)) - yhW + {z + <5.)(02(A) + 5a</'2(A)) - M^) < Su- 

This implies that 

5u- Sy(l)3{X) - (5202(A) 



{y + 6y)<p'^{X) + iz + 5,W,iX)' 
So, 

/ llvi — V2I |l 
5x>0 implies \5x\ = O ( — 



D 



Lemma 6.6. //c > 15 then q.s. 

/SI < t <ro : C{t) >log^n. 

Proof Define a sequence 

^ ^ fmin{C(i + 1) - C{i), log n} < i < Tq 
\-ci/2 To<i<n 

The variables Xi,X2, ■ ■ ■ ,Xn are not independent. On the other hand, conditional on an event 
that occurs q.s., we see that 

Xs+i + ... + Xt = C{t) - C{s) for < s < i < To 

and 

E[Xt I Xi, . . .,Xt-i] < -ci/2 for t < n. 

Next, for < s < t < To let 

t 

X{s,t)= Y. Xirf. 

T = S + 1 

Note that 

X{s,t)>t-s. (90) 
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We argue as in the proof of the Azuma-HoefFding inequahty that for any 1 < s < t < n and u > 

{t — s) log^ n 
We deduce from this that 



(X,+i + --- + Xt>u- ciA(s, t)/2) < exp <j ^^ ^ } . (91) 



P(31 < s < f < To : C(s) = < C{t),s < t < t) < 

n^ exp { - ^-^,-^.--y-,^,,- — -"0--J ■ ^g2) 



2 I 2max {0,ciA(s, t)/2 — logn}" 

(i — s) log n 



Putting t — s = Li = log n we see from ([92]) that q.s. 

^1 < s < t - Li < To - Li : C(s) = < C{t),s <t <t. (93) 

Suppose now that there exists r < Tq such that ("(r) > Li. Then q.s. there exists ti <t <ti+ Li 
such that C,{ti) = 0. But then given ti, 

. w N . N f 2(ciLi/2-logn)2 
P(3ti < T < ti + Li : C(t) > Li) < exp <{ I ,^ 2 



Li log n 

Here we are using the generalisation of Hoeffding-Azuma that deals with maxj<i^ Xi + • • • + Xj. 

And then we get that q.s. 

^t < To : C{r) > Li. (94) 

We do this in two stages because of the condition C > in (j8ip . Remember here that C(0) = and 
([93l) says that ( cannot stay positive for very long. D 



7 Associated Equations. 

The expected changes conditional on v lead us to consider the following collection of differential 
equations: Note that we do not use any scaling. We will put hats on variables i.e. yi etc. will be 
the deterministic counterpart of yi. Also, as expected, the hatted equivalent of (j88|) holds: 

H , =2fi-yi -27/2 - zi. (95) 



/3(A) /2(A) 



Step 1(a). yi > 0. 



^ ^ _i_yi_ yi^ aVo(a) y2£ aVo(a ) 

dt 2fi Ajl^ /2(A) 2/i2 /2(A) 



l_i^_^^L2£V^ + i^^L22v:iZ, (96) 



dm _ m mz aVo(A) ^ yz A^ aVo(A) 

dt A 2/i2 J2(A) 8/i2/3(X) /2(A) ' 

C^^l _ Zi ZiZ A2/o(A) ^2 A^/q(A) 

dt 2/i 4/i2 /2(A) 4/i2 /2(A)2 ' ^ ^ 
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dy _ y A/2(A) yz )? >? fo{\) 
dt ~ 



2fi /3(A) 
y A/2(A) 



dz 

'dt~2fi /3(A) 2/i /2(A) 

dfi _ ^ z AVo(A) 
dt 2/i /2(A) 



8A' /3(A) /2(A) 

A/i(A) z^ AVo(A) 
4/i2 



/2(A)2 



Step 1(b). yi = 0,y2>0. 



dt 

dm 

dt 

dzi 
~dt 



dt 

dz 
'dt 



A' /2(A) ' 
^ 2^2 ^2i AVo(A) ^ p A^ AVo(A) 

/i /i2 /2(A) 4^2 /3(A) /2(A) 
zi ziz AVo(A) z^ AVo(A) 
A 2/i2 /2(A) 2/i2 /2(A)2 ' 

y A/2(A) yz X' X^foCX) 

i^ /3(A) V j3(A) /2(A) ' 
y A/2(A) I A/i(A) z^ AVo(A) 
A /3(A) A /2(A) 2/i2 ;2(A)2 ' 
z A2/o(A) 



^ = -2- -- 

dt A /2(A) 



(99) 
(100) 

(101) 

(102) 
(103) 
(104) 
(105) 
(106) 
(107) 



Step 1(c). yi =y2 = 0,zi > 0. 



dm 

d^2 ^ y^ A3 AVo(A) 
dt 8/i2 /3(A) /2(A) ' 

1^ AVo(A) 
/2(A) 
yz X^ 



dzi __ _ il _ £]_ 
dt ~ 2/i 4/^2 



# ^ J/ A/2 (A) 

c«i 2/i /3(A) 

d£ ^ ^ A/2 (A) _ 
^* " 2/i /3(A) " 

^ = -1 
dt 



8A' /3(A) 
^ A/i(A) 
2A /2(A) 
^ A2/o(A) 



^^ AVo(A) 

4A2 /2(A)2 ' 
AVo(A) 

/2(A) ' 

z' AVo(A) 
4A2 /2(A)2 ' 



2A /2(A) 



Step 2. yi = y2 = ^1 = 0. 



*-■ 



dt 



P A3 AVo(A) 
8AV3(A) /2(A) ^ 



(108) 
(109) 

(110) 

(111) 

(112) 

(113) 

(114) 
(115) 



25 



dzi 
~dt 

dy 
dt 

dz 
'di 



z' AVo(A) 

_ 1 _ i_ ,^^2 (A) 



y^ A3 A2/o(A) 



2/i /3(A) 8^2 /3(A) 
z A/i(A) 52 ^4^^(^) 









2/i /2(A) 4/i2 

^ A2/o(A) 

2A /2(A) ■ 



/2(A) 
y A/2(A) 
/2(A)2 ■ 2A /3(A) ^ 



+ 



(116) 
(117) 
(118) 
(119) 



We will show that w.h.p. the process defined by 2GREEDY can be closely modeled by a suitable 
weighted sum of the above four sets of equations. Let these weights be 9a, Ob, 6c and 1 — 6a — Ob — 6c 
respectively. It has been determined that yi,y2,zi are all 0(log n) w.h.p.. We will only need to 
analyse our process up till the time y = and we will show that at this time, z = n[n) w.h.p.. 
Thus yi,y2,zi are "negligible" throughout. In which case yi,y2,Z2 should also be negligible. It 
makes sense therefore to choose 6a = 0- The remaining weights should be chosen so that the 
weighted derivatives of yi,y2, z\ are zero. This has all been somewhat heuristic and its validity will 
be verified in Section 17.21 



7.1 Sliding trajectory 

Conjecturally we need to mix Steps 1(a), 1(b) 1(c) and 2 with nonnegative weights 6a = 0, 6b, 6c, 
O2 = 1 — Ob — Oc respectively, chosen such that the resulting system of differential equations admits 
a solution such that 2/2 (i) = and zi{t) = 0. 

We will write the multipliers in terms of 



A 



y^A5/o(A) 



^_ z^X^fo{X) ^ ^_yXf2{X) 



8AV2(A)/3(A) 4/i2/2(A)2 2fih{X) 

Using, dM]), (fT02]> . (fToHl) and (fTTil) we see yi{t) = implies that 

dyi . 



D 



zx^kjx) 

2/i/2(A) ' 



(120) 



= 



dt 



Equivalently 



0a = 0. 

Using dlT]), ([109]) and ([TTS]) . we see that y2{t) = implies that 

dm 



:i21) 







dt 



-1 + 



yz A3 A2/o(A) 
4/i' /3(A) /2(A) 
= -{l-A)Ob + A. 

Equivalently 



^^J^A^MA)^ 
8AV3(A) /2(A) 



A 
I- A 



yz A3 A2/o(A) 
8A' /3(A) /2(A) ' 



(122) 
(123) 



26 



Likewise, using ([MD, (flTO]) and (fTT6]) . 2;i(t) = implies 
dzi 







^0fe -e^ + B. 



+ 



-1 + 



i^ AVo(A) 
4/i' /2(A)2 



+ (1 - ^fe 



Equivalently 



^e = (1 + ^6)^ 



B 



I- A 



z^ AVo(A) 




4A' /2(A)2 ' 






(124) 




(125) 



From (fT23]l it follows that e*;, > iff 

i< 1, 

in which case, by (fT25]) . 61c > 0, as well. From (fTM]) and p^ it follows that 1 - 6^;, - 61c > iff 

2i + i3<l. (126) 

We conclude that 9b, Oc, I — 6b — 6c ^ [0, 1] iff Q < 1, see ([66]) . But this is implied by Lemma [6T2l 
It may be of some use to picture the equations defining da, Ob, 6c, 02'- 



(127) 



If in the notation of Lemma 16.61 we let r^i = {v : (^ < Li} then we may restrict our attention to v 
in p2|) - (|59|) such that v G fii. In which case, the terms involving yi,y2, z can be absorbed into 
the error term fopr t <Tq. The relevant equations then become, with 



6a 

il- 


-A)6b 


= 

= A 




—B6b +6c 


= B 


ea 


+6b +6c +62 


= 1. 



A 



Step 1(a). yi > 



y^A5/o(A) 



8^2/2(A)/3(A)' 

0. 


Wi - 


-yi 1 


%2- 


-2/2 1 


nA- 


- Zl 


ny 


-y 1 


E[z' 


- z 


W 


-M 1 



^ _ z^AVo(A) ^_ yXf2{X) ^_zA2/o(A) 



V/2(A)2^ 

= -1 + 
= A + O 
= B + 



2/i/3(A) 



2/i/2(A) 



log^iV 
AiV 
log^iV 

AiV 
log^iV 

AAf 

C-A + O 



log^iV 
AAf 



C-(l-C)-S + 



log^iV 
AiV 



1-D + O 



log^TV 
AA^ 



(128) 
(129) 
(130) 
(131) 
(132) 
(133) 
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step 1(b). yi = 0,y2>0. 

ny'i - yi 

IE[y2 - 2/2 

E[z[ - zi 

E[y' - y 

E[z' - z 

E[/x — /i I |v 

Step 1(c). yi = 2/2 = 0, zi > 0. 

JE[yi - yi I |v 
lE[y2 - V2 

E[z; - zx 
E[y' - y 
E[z' - z 
E[/i' - IX 

Step 2. yi = 7/2 = 2;i = 0. 

IE[yi - yi I |v 

Eb2 - ?/2 

E[z']^ — zi 
E[y' - y 
E[z' - z 
E[;u' - /i 



O 



log^iV 
\N 



-1 + 2A + 



log^iV 



25 + 



log^iV 
\N 



-2C -2A + 



log^iV 



2C -2{l-C)-2B + 
-2-2D + 



XN 



WiV 



XN 



-'^ 



A + O 



log^iV 
XN 



-1 + B + O 
-C-A + O 



log^N 
XN 

log^N 
XN 

C-{l-C)-B + 

'log^N 



log^N 
XN 



-l-D + 



XN 



-'^ 



A + 



B + 



log^N 

XN 
loR^N 



XN 
-1-C-A + O 



log^N 



XN 

l + C-{l-C)-B + 
'log^iV 



log^iV 
XN 



-1-D + O 



XN 



(134) 
(135) 
(136) 
(137) 
(138) 
(139) 

(140) 
(141) 
(142) 
(143) 
(144) 
(145) 



(146) 

(147) 
(148) 
(149) 
(150) 
(151) 
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7.2 Closeness of the process and the differential equations 

We already know that yi,y2,zi are small w.h.p. up to time Tq. We now show that w.h.p. y,z,ii 
are closely approximated by y, z, fi, which are the solutions to the weighted sum of the sets of 
equations labelled Step 1(b), Step 1(c) and Step 2. These equations will be simplified by putting 
yi = y2 = zi = 0. First some notation. We will use "05^,, to denote the expression we have obtained 
for the derivative of ^ in Case 1 (rj) or Case 2 in the case of i] = 2. We are then led to consider the 
equations: 

Sliding Trajectory: 

dv 

-17 = Gb4'b,y{y, z, fi) + 9ciJc,y{y, z, fi) + {l-eb- 9c)tp2,y{y, Z, fl) 

= eb{-2{C + A)) + 9,{-{C + A)) + {l-9b- 9,){-{l + C + A)) 
= -{C + A){29, + e, + l-9,-9,)-{l-9,- 9,) 
B-C 



I- A 



1. 



dz 

-jr = 9bi'b,z{y^ ^' A) + ^c^cAv^ ^' ji) + {i-9b- 9c)^p2.ziy: z, jj) 

dt 

= 9b{2{C - (1 - (7) - B)) + 9c{C -{l-C)-B) + {l-9b-9c){l + C-l + C-B) 

= {2C-B)i9b + l)-29b-9, 

_2C -2A-2B 

~ I- A ■ 
-1- = 9bipbAy^ ^' A) + Ocipc,^i{y, z, il) + {l-9b- 9c)ip2,^i{y, z, fl) 
= 9b{-2{l + D)) + ^,(-(1 + D)) + {l-9b- 9,){-{l + D)) 
= -{l + D){29b + 9, + l-9b-9,) 

1 + D 
1-A' 

The starting conditions are 

y(0) = n, z(0) = 0, /i(0) = en. (152) 

Summarising: 

dy B-C dz 2C - 2A - 2B dfL l + t) , , 

— - = 1: — - = ; -— = -. (153) 

dt I- A dt I- A dt I- A 

and 

»^ + MW=2A. (154) 

/3(A) A(A) 

We remark for future reference that ()153p implies that 

jl is decreasing with t as long as A > (155) 

and (1154p implies that 

y + z<'^. (156) 

A 
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Let u = u(t) denote {y{t),z{t), fi{t)) and let u = u(t) denote (y(t), z(t),/i(t)). We now show that 
u and u remain close: 

Lemma 7.1. 

||u(t) - u(t)||i < n^/^ /or 1 < t < To, w.h.p.. 

Proof Let 5^(v), 7/ = a, b, c, 2 be the 0/1 indicator for the process 2greedy applying Step 1(?/) 

for T] = a,b,c or Step 2 li rj = 2 when the current state is v. For times ti < t2 we use the notation 

t2 



t=tl 
Now let p = n" where a = 1/4. It follows from Lemma 16.51 that for t < Tq — p, 

\X{t)-X{t + p)\<j^^^^. (157) 

Because A changes very little, simple estimates then give 
Claim 2.2. 

//||u(t)-u(t)||i <n8/9 i/ien 

|A(0-i(.)l = o(Ma_jp!l) |B(,)_B(,)| = o(MIg<*) (160) 

|cw-c(*)i=o(M)__|Mk) |z>(,,_o(,,| = o(l!^iM)!!i) dei) 

Proof The first expressions in (jl58p and (jl59p are easy to deal with as the functions fj are 

smooth and A is bounded throughout, see Lemma l6.3i Thus the each fj changes by 0(plogn/A^) 
and y,z,p change by O(plogn) and p = il(iV). 

For (I160p and (I16ip we use Lemma 16.51 to argue that 

Our assumption t < Tq implies that p{t) = Q{n) and then p{t) ~ p{t) and N{t) ~ N{t) and we 
can argue as for ()158p and ()159p . 
End of proof of Claim [272] 

Now fix t and define for ^ = yi,y2,zi, 

^, , fe(t + « + i)-e(i + ^) t + i<To 
\E[e(i + i)-e(i)|v(t)] i + i>ro 
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Then, 

logn > E[X,{0 I v(t + i)]= Yl ^(i + i)^vA^it + i)) + (^^^^^7^) (162) 

It follows from l\157\i - (jl59p that for all r], ^ and i < p, 

(loE n \ 

It then follows from ()162p that q.s. 

log n > Eiat + p)- m I uW] = E ^^("(*' * + /^))V'.,s(u(i)) + O r J^] 

This can be written as follows: We let A^ = Aa(u(t, t + p))/p etc. and A = A{t), B = B{t). 

-Aa =Oip-Hogn) 

il-A)Ab =A + 0{p~Hogn) 

-BAb +Ac =B + 0{p-Hogn) 

Aa +Ah +Ac +A2 = 1. 



(163) 



(164) 



In comparison with (|127p we see, using (jl60|) . (J16ip that 

\p9^{u{t)) - Ad = O (^logn+ ^""^^^~"^^^"^ ) for ^ = a,b,c,2. (165) 

Note that A,A < 1/2, see ()126p . This will be useful in dealing with 6i, and A^. 
We now consider the difference between u and u at times p, 2p, .... We write 

ip 

i{ip) - i{ip) = e((i - i)p) - i{{i - i)p) + Y. ([^(*) - ^(* - 1)] - [^^(*) - ^(* - 1)]) (166) 

t=(i-i)p+i 

where ^ = y,z, p and (, = y,z, p, in turn. Then we write 

^(t) - e(t - 1) = Oi + A and e(t) - e(i - 1) = at + A (167) 

where 

«t= I^ 6rj,^{vi{t - l))iP^^^{u{t - 1)) and ^t = ^{t) - ^{t - 1) - at 

rie{a,b,c,2} 

and 

"*= Z] ^^,5-(u(t-l))V'^_j-(u(t-l))andA = |(t)-e(t-l)-ai. 

»;6{a,6,c,2} 

It follows from (13T\) . (fT32]) etc. that 

E[A|u(t-l)]^0^^°^^^(^)^ 



\mm)j 
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An easy bound, which is a consequence of the Azunia-Hoeffding inequahty, is that 



ip 



Y, A > P'/' log' n I < e-^(i°g ") . (168) 

.t=(j-i)p+i 



We see furthermore that 

ip 



t={i-l)p+l 

= f; je'(u(t-i + ,,))- ^ ^^_^-(u(t-i))|;(u(t-i)) 

i=(i-l)p+l \ r]&{a,b,c,2} ^ 

= f; |e'(u(t-i)) + o('i^]- 5] 0^^^-(u(t-i))|;(u(t-i)) 

= o(^)=o(pV.log^n), (169) 

where < q < 1 and ^l(t) is the derivative of ^ in Case rj. 

In this and the following claims we take A^ = N{ip), the number of vertices at time ip. 

Now write 

p log n 



^p ■•^ / 

IZ "*= E E '^'7,«(uW)(V'^,?M(i-i)/o) + o, ^ 

t=(j-l)p+l i=(i-l)p+lr?e{a,6,c,2} ^ ^ 

= Yl Mu{{^-l)p + l,^p))^,,^{u{{i-l)p) + o(!^^^ (170) 

T;e{a,b,c,2} ^ ^ 

and 

ip 

t=(i-i)p+i 

ip 

t=(j-l)p+l??e{a,6,c,2} 

= E pOv,d^{{i-l)p))i^r,,d^{{i-l)p) + o(^Y (171) 
It follows that 

ip 

E {at-at) = Ai+A2 + o(^p^/^log^n^ (172) 

i=(i-l)p+l 

where 

Ai = Y (^?("((^ - 1)P + 1' ^Z')) - P^'y-cCuCl^ - l)p)))^r,,e(u((i - l)p) 

r)e{a,fe,c,2} 
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M=P ^ V'»),«(u((i- l)p)('0,,,5(u((i- l)/))-^^,5(u((i-l)p)) 

»?e{a,6,c,2} 

It follows from (jl66p to (jl74p that w.h.p., ip < Tq implies that with 

ai = \\\i{ip) - \i{ip)\\i (175) 



that for some Ci > 0, 

where iVj = N{ip) > n/2. 
Putting 

we see by induction that 



<a.-i{l + ^)+2p'/'log' 



n 



n-n(-t) 

1=0 ^ -^ ^ 



< g2CiJp/n 



j=o ^ 

Since i < nj p we have 

||u(zp) — u(ip)||i = 0(np~ ' log n). 

Going from p[ro//9j to Tq adds at most plogn to the gap and the lemma follows. D 

8 Approximate equations 

The equations ()153p are rather complicated and we have not made much progress in solving them. 
Nevertheless, we can obtain information about them from a simpler set of equations that closely 
approximate them when c is sufficiently large. The important observation is that when A is large, 

i«l; 5«1; (7^1^; ^ « ^; A « ^. (177) 

We will therefore approximate equations (|153p by the following equations in variables y, 5, /i, A: 

y' = -^ - 1 (178) 

y + z 

z' = -^- (179) 

y + z 

~^' = -l- ^!^ (180) 

(2/ + -2)2 
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A = ^^. (181) 

y + z 

The initial conditions for y, z, /x, A are that they start out equal to y, z, /i, A at time t = i.e. 

y(0) = n; 5(0) = 0; /i(0) = en; A = 2c. (182) 

8.0.1 Analysis of the approximate equations 

The first two approximate equations imply {y + z/2)' = —1, so that 

z 
y + - = n-t. 

Using the second approximate equation and y = 1 — t — z/2, we obtain 

,, _ 2(?i - t - z/2) 
~ n-t + z/2 ' 

or, introducing t = n — t and 

X 



2(n-t) 2r' 
we get 



X2 + 1 ^ r 

Integrating, 

- ln(X^ + 1) + arctan X = - In r + C. 

Now, at t = we have t = n and X = 0. So C = Inn, i. e. 

- ln(X^ + 1) + arctan X = - ln(T/n). 

Let T satisfy y{T) = 0. At t = T, we have A = 1, so 

1 vr 

In n — ln(n — T) = — In 2 H 

^ ' 2 4 



^^^ (iA = --dr. (183) 



which implies 



Note that 



f = ( 1 - ^y^e"^/^ ) n ?« 0.677603n. (184) 



~,_ 2/2' 2^(y' + z') 



y + z {y + zY 

2 Azjl 2jl ( y 



y + z {y + zY {y + zf \y + z 

2 2zfl 

y + z (y + z)3 

_2 zX 

y + z [y + zY' 
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which imphes that A is decreasing with t, at least as long as y,z,X > 0. 



(185) 



Here 



(y + z)2 (n-i + 5/2)2 

z 



Likewise 



So A satisfies 



(n-t)2(l + X)2 
2X 
"(n-t)(l + X)2- 

2 2 



A' 



y + z {n-t){l + X)' 

2 2X 



(n-t)(l + X) {n-t)il + Xy 



A, A(0) = 2c. 



Using (|183|) . we obtain 



dA 



2X 



dX 1 + X2 (1 + X)(1 + X2) 

Integrating this first-order, linear ODE, we obtain 



A, A(X) 



x=o 



2c. 



A(X) 



^/r+x^ 



2c 



X 9 arctan : 



(i + x)^/rT 



: (ix 



In which case 



A(r)« 1.53c- 1.418. 



8.0.2 Simple Inequalities 

We will use the following to quantify (I177p : 

i<m=i+e,. i<m=i+e., i<m=i^es. 



/3(A) 



/2(A) 



where 



A2 1 + A 

^1 — — r~T7' ^2 — -— T7> ^3 



/3(A) 

A2 + 2A + 2 



2/3(A) /2(A) 2/3(A) 

We use the above to verify the following sequence of inequalities for y, z, fi, A: 

2A / N ? 2u 

(1 - £4) < A < ^ 



y+z y+z 

< i < £5 

< -B < £6 



(186) 

(187) 



(188) 
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where 

1 + ^1 8/o(A) /o(A) 

(We use ()156p to get yz < /i^/A^ for use in defining £5). 

For (fT88]) we use 

A(y + i)> 1 _/3(A)_ 1 



2A "max|f§,f0| /2(A) l + ^r 

L/3(A) /2(A) J 

It follows from (|188p that the initial value Aq of A satisfies 

2c> Ao > 2c(l-£4)- 
Now £4 < .0001 for X > 15 and so 

2c(l - .0001) < Ao < 2c. (189) 

8.0.3 Main Goal 

Lemma 18.11 (below) in conjunction with Lemma 17. H will enable us to argue that w.h.p. in the 
process 2greedy, at some time T <Tq we will have 

y{T) = 0, z{T) = n{n) and A(t) = 0(1) for t < T. (190) 

Define 

r+ = min{i > : y{t) < or z{t) < or y{t) < or z{t) < 0} . 

We can bound this from below by a small constant as follows: Initially A, B are small C is close to 
one for c > 15 and so ()153p implies that z is strictly increasing at the beginning. Also, y, y start 
out large (= n) and so remain positive initially. 

Next define 

Ti =min|r+,max|t : A(t)) > A* and min{2/(r) + z(T),y(r) + z(r)} > /3n for r < tjj (191) 

where 

/? = -.01 + i(r)/n = -.01 + 2(n - f)/n ^ .63 

and 

A* = A(f ) - 5. 

Comparing ^T87\ and (fT89]l we see that Aq > A*. 

Note that 

Ti < t. (192) 

This is because y{T) = and y'{T) = — 1. 
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Lemma 8.1. For large enough c, 

y{Ti) = < z{Ti) = (7(n) and A(Ti) = 9.{l). (193) 

Proof It follows from (fTCHj) and Section [glT^ that 
y < - 1 < -^-— r -l + ej 

1-^5 y + ^ 

y'>-%^-l>-4.-l-es. (194) 

1-65 y + z 

., 2yA 1 + gi , 2j/ 

2/u 1-^5 y + ^ 

2{/ 
z' > — 267 (195) 

y + z 

X < -^^ (196) 

y + z 

X>^{l-e,)>^-e, (197) 

y+z y+z 

I zX? 2zn , ,9 2z/i 

A < -1 - ipr < -1 - T^r-^ 1 - Sif < -1 - % + £10 

2/i (y + z)^ [y + z)^ 

1 + ^(1+62) 2zA , , 

A' > f > -1 - -rr-^ - en- 198 

1-65 (y + z)^ 



where 



£4 + £5 + £6 £l + £5 

1 - £5 1 - £5 



£7 — , £8 — , £9 — ^£4, 



2A£4 A(£2+£5)+£5 

l-£4 (l-£4)(l-£5) 

When i = we have y = n, z = 0, jl = en and A satisfying (jl89p . we see that Ti > for c > 15. 
We can write y(0) = n, z(0) = 0, /i(0) = en and 

y' = -^-^ -1 + 01 where \9i\ < 6*. (199) 



y + z 
2y_ 
y + z 



z' = -^^ + 02 where 1^21 < 25*. (200) 



2zfx 
i^T~z) 
2/i 



/^' = -1 - ,„. , .^2 + ^3 where - £11 < ^3 < £10 (201) 



y + z 



+ 6*4 where - £9 < 6*4 < 0. (202) 
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where 

6* = max{ei,e2, • • • ,£3} • 

It can easily be checked that the functions ei,...,eii are all monotone decreasing for A > A*, 
(A* (15) ~ 16.549). Furthermore, (5* (16) < .00011 and our error estimates will mostly be 6* times 
a moderate size constant. The only exceptions contain a factor c, but if c is large then 5* will 
decrease to compensate. 

It follows from (fTSOl) and (|20TI) that 

fi, p, both decrease for t <Ti, since ^3 < 1 for A > A*. (203) 

The ensuing calculations involve many constants and the expressions (j214p and (j216p claim some 
inequalities that are tedious to justify. It is unrealistic to expect the reader to check these calcula- 
tions. Instead, we have provided mathematica output in an appendix that will be seen to justify 
our claims. 

The reader will notice the similarity between these equations and the approximation (|178|) - p8ip . 
We will now refer to the equations (|199p - (|202p as the true equations and (|178p - (jlSip as the 
approximate equations. 

8.0.4 y, z and y, z are close 
We claim next that 

max{|y(t) - y(t)|, \z{t) - 5(t)|} < (^*Fi(t/n)n for < t < Ti. (204) 

where 

FJx) = /3(e2'^^-/^ - 1) for x < -. (205) 

n 

for a > 0. 

Note that 

F^{t) = 2{aFa{t)/l3 + l). 

In the proof of (|204p . think of n as fixed and /i as a parameter that tends to zero. Think of e as 
small, but fixed until the end of the proof. In the display beginning with equation (|207p . only h is 
the quantity going to zero. Let 

Ui = y{ih),Vi = z{ih), Ui = y{ih),Vi = z{ih) for < i < n/h. 

Assume inductively that for i < iq = Ti/h 

\ui — Ui\, \vi — Vi\ < 5FiJ^ir{ih/n)n. (206) 



This is true for i = 0. 
Suppose that 



Fi+,((i + l)h/n) = Fi+Sh/n) + -F'{{i + d)h/n) 



n '^'' 
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for some < 9 < 1. 

Then by the inductive assumption and the Taylor expansion and uniform boundedness of second 
derivatives, 



Vi+i - Vi+i = Vi-Vi + h 
= Vi-Vi + h 
< Vi - Vi + h 



2ui 



2ui 



-+e2{ih)] +0(/i2) 

+ e2{ih)\ +0{h^) 

+ e2{ih)] +0{h^) 



Ui + Vi Ui+ u 

2ui{vi - Vj) - 2vi{ui - Uj) 

(iii + Vi)(ui + Vi) 
2{ui + {ij)max{|z;i - Vi\,\ui - Ui\] 



(207) 



{Ui +Vi){Ui +Vi) 

< 6*Fi+eiih/n)n + 26*h {Fi{ih/n)/p + 1) + O^h^) 
= d*{Fi+s{ih/n)n + hF[{ih/n)) + 0{h^) 

= 6*{Fi+eiii + l)h/n)n + h{Fi{ih/n) - F(+,((i + e)h/n))) + 0{h^) 

< S*{Fi+e{{i + l)h/n)7i - n{£h), 

completing the induction. 

The remaining three cases are proved similarly. This completes the inductive proof of p06p . Letting 
e — >• we see for example that y{t) — y{t) < 6*Fi{t) for t < Ti. This completes the proof of (|204|) . 

Let 

Qo = Fi{f/n). 

Observe next that 



(y + z)' 



y 



y + z 



1 <o. 



So for t < Ti we have 

y + z> y{f) + ~z{f) = ~z{f) = 2{n - f) = {^ + .01)n 
Furthermore, putting X = 1 and going back to (jl86p . 

Qparctanx 



A(r) = ii±4^L^ I 2c 



x=0 (1 + x)Vl + x2 



dx = a\c — a2- 



(208) 



(209) 



(210) 



8.0.5 Lower bounding A 

We now show that A — A is small. We now use (j20ip and ()204p to write for t <Ti, 



l/i' - A'l 



< l^sl + 



2flz{{y + zY + 45*Fi{t/n){y + z)n + 4(5*^Fi(i/n)^n^) - 2jX{y + zY{z - 8*Fi{t/n)n) 



h\ + 



Now, using ([2031) . 



{y + zY{y + zY 
2z{y + zfifi -il) + 26*Fi{t/n)n{y + z){Afiz + fl{y + z)) + 8fiz6*'^Fi{t/nYn^ 



{y + zy{y + zy 



4fiz + jj,{y + z) 4:11 + jl 5c 



{y + z){y + zY {y + zY P'^n 
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and 



8fiz 8c 



So, 

pn 

where 

_ lOapc 8cal6* {2c +1)6* 
"^ ~ ^2- + —^3— + /3(i_5*)2' 

where the third term is an upper bound for £10, £11 and its vahdity rests on (I196P and (|203p . with 
which we bound A < 2c//3. 

Integrating, we get that if C = 1^-/^1 then 

C'-^< a36* 
pn 

and so 

l/i' - A'l < a3<J*e2*//^" /* e-2-//^"(iT = a^d^^ie^'/"'' - 1) < a^6*n. 
Jt=o 2 

for t <Ti, where 

It then fohows that as long as i < Ti, 

2fi{y + z) - 2ii{y + z) 



A - A = -04 + 

<e9 + 



{y + z){y + z) 
2{jl + ai5*n){y + z) - 2jl{y + z - 2aQ5*n) 



{y + z){y + z) 
2a4(5* 4cQo5* 

It folfows from (jl85|) that for t <Ti we have 

A(t) > A(ri) - a5(5* (211) 

where 

2c 2q4 4cao 

"^ = ^ + — + ^- 

We now argue that y{Ti) = and A(ri) > A*. This proves the LemmalHTH since y{Ti)+z{Ti) > Pn. 
Suppose then to the contrary that yiTi) > 0. Recall that Ti < T (see ()192p ) and suppose first that 
Ti < f. Now let 

T2 = min JTi + en, (Ti + f)/2^ 

where < e < 10"^*^ is such that 

maxJTG [Ti,r2]:emax{|A'(T)|,|y'(r)|,|i'(T)|}<10-^°}. (212) 

The existence of such an e follows by elementary propositions in real analysis. 
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We will argue that r E [Ti,T2] implies 

A(t) > A* and min{y(r) + z{T),y{T) + z{t)} > j3n and min{y(r),i(T),y(r),i(T)} > 0, 

which contradicts the definition of Ti . 

Fix r € [Ti,T2]. Now t <T implies that y(r) > 0. Together with (jl79p we see that i increases for 
i < T and hence z{t) > 0. We have y'{t) > -2 (see (fTTHD l and 5'(t) > for t < T2 (see (fTTO]) ) and 
so for some ti,T2 G [Ti,r2] 

y(T) = y{Ti) + (r - ri)y'(ri) > y{Ti) - 2en. 
z{t) = z{Ti) + (r - Ti)z'{t2) > z{Ti). 

It follows (using (I2MD) that 

y(.T) + z(t) > y(To) + z{To) - 2en > y{f) + z{f) - 2en > /3n. (213) 

We have, for some r3,r4 G [Ti,T2], 

y(r) + z(t) = y(Ti) + z{Ti) + (r - ri)(y(r3) + zir^))' 

> y{Ti) + 5(ri) - 2Fi{f/n)5*n - 2 x 10^^°n 

> (^ + .01 - 2 (ao<5* + 10"^°))) 71 

> pn. (214) 

We now argue that z{t) > 0. Equation (jl79p shows that z is strictly increasing initially. Also, if 
A > A* then ^3 < 1/8. From (|200p we see that z is strictly increasing at least until a time tq when 
y(7"o) ^ /3<J*- On the other hand, we see from (j214p that if ^(t) < 135* then z(r) > 0. So, 



min{y(r),i(T),y(r),z(T)}>0. (215) 



Now we write 



A(r) = A(ri) + (r - ri)A'(r3) 

> A(Ti) - (A(ri) - A(ri)) - 10-1°, using (EH]), 
>aic-a2-a55* -10"^°. 

> A*. (216) 

We must now deal with the case where Ti = T. Here we can just use (j204p to argue that z{Ti) > 
z{Ti) - ao6*n > and y{Ti) + z{Ti) > y{Ti) + z{Ti) - ao5*n > (/3 + .01 - Qo5*)n > /3n and 
A(r) > \{f)-a^8* > X*. 

This completes the proof of Lemma 18.11 D 

It follows from LemmaOthat w.h.p. y{Ti) < n^/'^ , z{Ti) > /3n - n^/^ and A(ri) > A*. We claim 
that q.s., y becomes zero within the next v = v?!^^ steps of 2greedy. Suppose not. It follows 
from Lemma 16.51 that A changes by o(l) and by ([6]) that z changes by o{n) during these v steps. 
Thus Ti + z/ < Tq. It follows from (j93|) that q.s. at least z/log"^ n of these steps will be of type Step 
2. But each such step reduces y by at least one, contradiction. 

This verifies (fTOOll . 
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9 The number of components in the output of the algorithm 



We will tighten our bound on (^ from Lemma |6.6[ 

Lemma 9.1. If c> 15 then for every positive constant K there exists a constant ci = ci{K^ such 
that 

P (31 < t < Ti : C(i) > C2 log n) < n~^ . 



Proof We now need to use a sharper inequality than (j9ip to replace Li by what is claimed in 

the statement of the lemma. This sharper inequality uses higher moments of the Xt's and we can 
estimate them now that we have the estimate of the maximum of ((t) given in (j9ip . So, we now 
have to estimate terms of the form 

for ^ = yi, 2/2, -Z2, 2 < j < log n and ry = v or b, d. 

We use the inequality 

{a + b + c + dy < 4^{\a\^ + \b\^ + \c\^ + \d\^) 

for j > 1. 

We will also need to estimate, for 2 < j < logn, 

^ — k\ — = ^ ^ (A. -2)! <^^ Z^^r = ^'^ 2^2^|4^r 

^'^' t {'>' E 7]^ S ^'^'^'^ t {i} S 2'i!A^«.\ 



£=0 ^ '' k>e ' ' £=0 



Here j-^} is a Stirling number of the second kind and it is easy to verify by induction on j that the 
Bell number ^21=0 {1} — J^-- 
Step 1. yi + y2 + zi > 0. 

Step 1(a). yi > 0. 

*i(!/.|v) = o(2«AVj!(^ + !^)). (218) 

*,(?/2||v|) = o(23^A^e^j!(A3 + ^ + i^)). (220) 
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*feib,d) < 4i||i+i:^(*-i)'|+E|^(*-i)'|^+Bffl: 



2" fe-j^^ 



fe>2 



*(.H|v|) ^ 0(2«AV,,(A^4.i^)) 



Step 1(b). yi = 0,y2>0. 



(221) 

(222) 
(223) 



kzk 



,-2y2 



*.(yi|b,d) < 4^' 2E^(^-1)^|: + 12M]1 



(224) 
(225) 



2y2 



fcZfc 



,2^2 



fcZfc 



,-3y3 



*,fe|b,d) < 4M-f + 2|:^,*-lF|i + 2|:^,*-lF|i + ^|,(226) 



*,fe||v|) ^ 0(2«AVil(A=4.i^)). 



(227) 



/cZfe 



2;i 



fcZfc 



,-2z2 



*,(.,|b,d) < 4^K7 + 2Ei7(^-l)^i;; + 2Ei7(^-l)^i? + l228] • (228) 



fJ' 



k>2 



2ll 



2ii 



k>2 



2^l 



2/i 



^j{zi I |v| 



o{^"^'H''4^'^)) 



(229) 



Step 1(c). yi=y2= 0, Zi > 0. 

^j(yi|b,d) = £[23l} 
*i(?/i I |v|) = £ |232} 



(230) 

(231) 
(232) 



^i(y2 I b,d) 

*j(y2 1 |v|) 



^ ^'(g|f(^-v|f+^ 



O h^-JA^e^i! A3 + ^ + 



C log2 N 



N XN 



(233) 
(234) 



43 



*,(.jb,d) < 4qii+j:|i(.-iy|+j:|i,.-i)>|i+^). (235) 



2/i 



Now let Et = {C{t) < log^ 7i:l<T<t}. Then let 

' {c{i + 1) - ai))m) o<i<n 



(236) 



Yi 



-ci/2 



Ti < i < n 



Then, q.s. 



y,+i + . . . + y* = c(t) - C(s) for < s < t < Ti. 



For some absolute constant C2, and with 6 



Cl 



100ce^='=+i(3ce) 



TTo-T^- and i < Li, 



E[e'^'+^ \Ys+i,...,Ys 



s+i-~l\ 



J^G'^E 



k=0 



\^k 
^ s+i 



k\ 



y 



s+li • • • 1 ^ s+i—1 



y- 



< 1 - eci/2 + C2 X] 6H^^\{J?)''+'^e^^^ < e-^"l/^ 



fc=2 



where we have used (j81|) and we have used Lemma 16.31 to bound X{i). 
It follows that for t — s < Li and real u > 

p(y,+i + • • • + y* > u) < e-^(^+'=i(*-*)/3) 

Suppose now that there exists t < Tq such that (^(r) > L2. Now q.s. there exists ti < t < ti + Li 
such that C(^i) = 0- But then putting u = — logn and L2 = — °^" we see that given ti, 



Cl 



P(3ii < r < ti + Li : C(t) > L2) < 



'^^7 ' 



?(ciI/2/3-logn) ^ 7),-^ 



D 



We get a new path for every increase in Vqj-, J < 1. If we look at equations (fT2|) etc., then we see 
that the expected number added to Vb,j at step t is 0{C{t)/n{t)). So if Zp{t) is the number of 
increases at time t and Zp = Xldo ^p{^)^ where T3 is the time at the beginning of Step 3, then 



E[Zp] = OE 



;""S^) 



O log n E 



log 



( m 

Urn) 



(237) 



Now in our case fJ-iT^) = Q(n) with probability 1 — o(n~^) in which case E[Zp] = O(logn). We 
will apply the Chebyshev inequality to show concentration around the mean. We will condition 
on ||u(t) — u(t)||i < n^'^ for t < Ti (see Lemma l7.ip . With this conditioning, the expected value 
of Zp{t) is determined up to a factor 1 — 0(ri~^'^log n) by the value of u(t). In which case, 
¥.[Zp{t) I Zp{s)\ = (1 + o(l))E[Zp(f)] and we can apply the Chebychev inequality to show that 
w.h.p. Zp = O(logri). We combine this with Lemma 15.41 to obtain Theorem[TJ 
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10 Hamilton cycles 

We will now show how we can use Theoreni[T](^a) to prove the existence and construction of Hamilton 
cycles. We will first need to remove a few random edges X from G = G^^ in such a way that the 
pair {G — X, X) is distributed very close to {H = G^-^^_,-^,, Y) where F is a random set of edges 
disjoint from E{H). In which case we can apply Theorem [1] to H and then we can use the edges of 
Y to close cycles in the extension-rotation procedure. 

10.1 Removing a random set of edges 

Let 



and let 



n ' log n 



n={iH,Y):HG gi^^_,,Y C ( ^^^ ) , |y| = s and E{H) n Y 



where G^-^ = IC^-^l 

We consider two ways of randomly choosing an element of O. 

(a) First choose G uniformly from G^^ and then choose an s-set X uniformly from E{G) \E^{G), 
where E^{G) is the set of edges of G that are incident with a vertex of degree 3. This produces 
a pair {G — X,X). We let P^ denote the induced probability measure on il. 

(b) Choose H uniformly from Gn~cn-s ^^"^ then choose an s-set Y uniformly from (^2 ) \ ^i^)- 
This produces a pair {H,Y). We let Pf, denote the induced probability measure on fi. 

The following lemma implies that as far as properties that happen w.h.p. in G, we can use Method 
(b), just as well as Method (a) to generate our pair {H,Y). 

Lemma 10.1. There exists Qi C Q such that 

(i) P,(Oi) = 1-0(1). 
(ii) uj = {H,Y) G ill implies that Fa{uj) = (1 + o(l))Pfe(a;). 

Proof We first compute the expectation of the number fi^ = /^3(G) of edges incident to a 

vertex of degree 3 in G chosen uniformly from Gncn- ^^ ^'^^ ^^^ ^^^ random sequence model of 
Section [3l We will show that /i3 is highly concentrated in this model and then we can transfer this 
result to our graph model. Observe first that if 1^3 is the number of vertices of degree 3 in Gx then 
Lemma 13.31 implies that 

3!/3iAj 



Here A is the solution to A/2(A)//3(A) = 2cn. 

To see how many edges are incident to these v^ vertices we consider the following experiment: 

3! /3(A) • 



Condition on 1/3 = pn where p will be taken to be close to p3 = wtjx\- ^^ take a random 



45 



permutation vr of [2c?i] and compute the number Z of i < en such that {Tr{2i — 1), 7r(2i)} n [Sz^a] 7^ 0. 
This win give us the number of edges in Gx that are incident with a vertex of degree 3. Now 

^ r 1 / 2cn — on 2cn — on — 2\ , , ^, , ^-. 

E[Z] = cn[\- -^ ^J_^ J = cn(2p - p^ ^ 0(l/n)). 

Now interchanging two positions in vr can change Z by at most one and so applying the Azuma- 
Hoeffding inequality for permutations (see for example Lemma 11 of Frieze and Pittel [15j or Section 
3.2 of McDiarmid [IB]) we see that P(|Z - E[Z]| > n) < e-"'/('=") for any n > 0. Putting this aU 
together we see that 

P(|/i3(G) - p3(2 - P3)cn| ■>u)< e-"'/'=^ 

Now let 

^Sn = {C G Cn ■■ l/^3(G) - P3(2 " Pz)cn\ < n^^iogn} 

and 

This satisfies requirement (a) of the lemma. 
Suppose next that w G ri^- Then 

p .^N ^ _1 1 = l + 0(log-^n) 

"^ '^ I 5>3| /cn(l-p3)2±nl/2lognA |r'5>3| lcn{\-pzY\ ^'^°' 

|yn,cn| I, s J |yn,cn| " i^ ^ y 

'^^(-) = ^^ • TGP^ (239) 

l='n,cn— si I, s j 

One can see from this that one has to estimate the ratio |^„7:nl/l^n~cn-sl • ^'^'^ ^^^^ ^^ make estimates 
of 

M = I {(Gi,G2) G gSn X Ci-s : KGx) 5 i?(G2)} |. 

We have the following inequalities: 

IC'5>3Ycn(l-p3)2-ni/2logn\ ^5>3 /cn(l - ps)^ + n^/^ logn 

iCil E (^^(^-f)'+^)e~'^Vcn (240) 

|«|>n-'-/2 logn 

M = \Q'^lJ^^-'\ (241) 



We get (12401) by summing fi^^Gi) over Gi G ^n^n ^^^ bounding /i3(Gi) according to whether or 
not G is in GrTcn- Equation (I24ip is obtained by summing over G2 G ^„7.,„_s, the number of ways 
of adding s edges to G2 ■ 

Now 



'cn(l - psf + U\ ^_„2/,„ ^2 y^ /cn(l - P3)2^| ^oius/n)^-uycn 

\u\>n^/'^logn u>n^/^logn 
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u>n^^'^ logn 



It follows from this and (j240p that 



M = |aS^ir('7^n(l + 0(log-n)). 



By comparing with (j24ip we see that 



'^"''^"' = fl + o(l)) ^ ' ' 

\^n,cn—s\ \ s ) 



The lemma follows by using this in conjunction with (|238p and (|239p . D 

10.2 Connectivity of G^|3 

Lemma 10.2. G^-^^ is connected, w.h.p.. 

Proof It follows from Lemma flO. II that we can replace Gf^^ by G^^^^g plus s random edges. 

We use the random sequence model to deal with G„-„_j,. Let Fix 4 < k < n/ log n. For K C [n], 
e{K) denotes the number of edges of Gx contained in K. Let io = logn/(loglogn)^'^. Then with 
A the solution to A/2 (A) //3(A) = 2c, 



{3K<Z[n]:e{K)>5k/4)<o{l)+SJ\ > . ^TTTTTtU, / J " • (242) 




^ W / cn\ /d^'"/' 

Explanation of (I242p : Here 5^ = 1 + o(l) for k < log n and 0{n}''^) for larger k. The term 
,^r f^^k bounds the probability that the total degree of K is d, see ([5^ . Given the degree sequence 
we take a random permutation vr of the multi-set {dx(j) x J : j £ M} and bound the probability 
that there is a set of 5A;/4 indices i such that 7r(2i — l),7r(2i) G K. This expression assumes that 
vertex degrees are independent random variables. We can always inflate the estimate by 0(n^'^) 
to account for the degree sum being fixed. This is what 6^ does for k > log n. For smaller k we 
use (l3|). The bound of d < iQk arises from Lemma l3.2l fb). 

Let cj/fc denote the RHS of (j243p . Then, we have "^^L^^ " crfc = o{l). 

But if no G has minimum degree at least 3 and K contains at most 5|ir|/4 edges then there must 
be edges with one end in K. So, we see that w.h.p. the minimum component size in G will be at 
least n/log n. We now use the result of Section flO-ll If we take H = G„-„_^, s = n^'^ log" n 
then we know by the above that w.h.p. it only has components of size at least n/log n. Now add 
s random edges Y. Then 

¥{H + y is not connected) = o(l) + log"^" n ( 1 j^— ) = o(l). 

V log n J 

Now apply Lemma llO. 11 D 
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10.3 Extension-Rotation Argument 

We will as in Section riU.2l replace Gf^^ by G„-„_g plus s random edges Y. Having run 2greedy 
we will w.h.p. have a two matching Mq say such that Mq has O(logn) components. 

The main idea now of course is that of a rotation. Given a path P = {ui,U2, . . . ,Uk and an edge 
{uk,Ui) where i < k — 2 we say that the path P' = (ui, . . . ,Ui,Uk,Uk-i, ■ ■ ■ ,Ui-^.i) is obtained 
from P by a rotation, ui is the fixed endpoint of this rotation. We now describe an algorithm, 
EXTEND-ROTATE that W.h.p. converts Mq into a Hamilton cycle in 0(n^'^+°(^)) time. 

Given a path P with endpoints a, b we define a restricted rotation search RRS{i') as follows: Suppose 
that we have a path P with endpoints o, b. We start by doing a sequence of rotations with a as the 
fixed endpoint. Furthermore 

Rl We only do a rotation if the endpoint of the path created is not an endpoint of the paths that 
have been created so far. 

R2 We stop this process when we have either (i) created v endpoints or (ii) we have found a path 
Q with an endpoint that is outside Q. We say that we have found an extension. 

Let END{a) be the set of endpoints, other than a, produced by this procedure. The main result 
of [U] is that w.h.p. that regardless of our choice of path P, either (i) we find an extension or (ii) 
we are able to generate 'n}~°y^> endpoints. We will run this procedure with v = n^'^ log n. 

Assuming that we did not find an extension and having constructed END{a), we take each x € 
END{a) in turn and starting with the path Px that we have found from a to x, we carry out R1,R2 
above with x as the fixed endpoint and either find an extension or create a set of u paths with x 
as one endpoint and the other endpoints comprising a set END{x) of size u. 

It follows from [2] that the above construction RSS{y) can be carried out in 0(z/^ log n) time. 

Algorithm EXTEND-ROTATE 

Step 1 Choose a path component P of the current 2-matching M, with endpoints o, b. 

If there are no such components and M is not a Hamilton cycle, choose a cycle C of M and 
delete an edge to create P: 

Step 2 Carry out RSS^u) until either an extension is found or we have constructed v + 1 endpoint 
sets. 

Case a: We find an extension. Suppose that we construct a path Q with endpoints x, y 
such that y has a neighbour z ^ Q. 

(i) If z lies in a cycle C then let P be a path obtained from C by deleting one of the 
edges of C incident with z. Let now P = x,Q,y,z,R and go to Step 1. 

(ii) If z = Uj lies on a path R = {ui,U2, . . . , u^) where the numbering is chosen so that 
j > k/2 then we let P = x,Q, y, z, Uj-i, . . . ,ui and go to Step 1. 

Case b: If there is no extension then we search for an edge e = {p,q) £ Y such that 
p E END(a) and q £ END{p). if there is no such edge then the algorithm fails. If 
there is such an edge, consider the cycle P + e. Now either C is a Hamilton cycle and 
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we are done, or else there is a vertex u £ C and a vertex v ^ C such that (u, v) is an 
edge of H. Assuming that H is connected, see Lemma ll0.2[ We now delete one of the 
edges, (u, w) say, of C incident with u to create a path Q from w to n and treat e as 
an extension of this path. We can now proceed as Case a. 

10.3.1 Analysis of EXTEND-ROTATE 

We first bound the number of executions of RSS{iy). Suppose that Mq has k < Ki log n components 
for some Ki > 0. Each time we execute Step 2, we either reduce the number of components by one 
or we halve the size of one of the components not on the current path. So if the component sizes 
of Mq are ni, n2, . . . , n^ then the number of executions of Step 2 can be bounded by 



K + y^ log2 Ui < K + K log2(n/K) = 0(log U log log 77.) 



i=l 



(Re-call that log(a6) < 21og((a + b)/2) for a, 6 > and you will see the first inequality here). 

An execution of Step 2 takes 0(i^^ log n) time and so we are within the time bound claimed by 
Theorem [2j 

We first argue that extend-rotate succeeds w.h.p.. Suppose that the edges of Y are ei, 62, . . . , e^. 
We can allow the algorithm to access these edges in order, never going back to a previously examined 



edge. The probably that an e,, can be used in Case b is always at least 



a)- 



G) 



- &7? (^^ ^^^^ 
subtracted s because some of the useful edges might have been seen before the current edge in the 
order). So the probability of failure is bounded by the probability that the binomial Bin ( s, °^^y" 
is less than K2 log n log log n for some K2 > 0. And this tends to zero. This completes the proof of 
Theorem [2j 



11 Concluding remarks 



The main open question concerns what happens when c < 15. Is it true that (jl93p holds all the 
way down to c > 3/2? We have done some numerical experiments and here are some results from 
these experiments: 



c 


y final 


^ final 


l^ final 


^final 


3.0 


0.000008 


0.283721 


0.398527 


1.822428 


2.9 


0.000009 


0.242563 


0.326139 


1.602749 


2.8 


0.000010 


0.197461 


0.253645 


1.370798 


2.7 


0.000010 


0.148901 


0.182327 


1.123928 


2.6 


0.000010 


0.098344 


0.114494 


0.858355 


2.5 


0.000010 


0.048976 


0.054010 


0.565840 



These are the results of running Euler's method with step length 10~^ on the sliding trajectory 
()153p . They indicate that (I193p holds down to somewhere close to 2.5. This would indicate some 
sort of phase transition in the performance of 2GREEDY at around this point. There is one for the 
Karp-Sipser matching algorithm and so we are led to conjecture there is one here too. 
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Can we prove anything for c < 15? At the moment we can not even show that at the completion 
of 2GREEDY the 2-matching M has o{n) components. This will be the subject of further research. 

Finally, we mention once again, the possible use of the ideas of [T^ to reduce the running time of 
our Hamilton cycle algorithm to 0{'n}^°^^') time. 

Our list of problems/conjectures arising from this research can thus be summarised: 

(a) Find a threshold ci such that 2greedy produces a 2-matching in Gf^^ with O(logn) compo- 
nents w.h.p. iff c > ci . 

(b) If ci > 3/2 then show that when c G (3/2, ci), the number of components in the 2-matching 
produced is 0(n°) for some constant a < I. 

(c) Analyse the performance of 2greedy on the random graph Gn,cn i-e. do not condition on 
degree at least three. Is there a threshold C2 such that if c < C2 then w.h.p. only Steps la,lb,lc 
are needed, making the matching produced optimal. 

(d) Can 2GREEDY be used to find a Hamilton cycle w.h.p. in 0{n^^°^^') time when applied to 
^ncn ^^d '^ sufficiently large? 

(e) How much of this can be extended to find edge disjoint Hamilton cycles in G^^Jj for k > A. 
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A Proof of (E 



To find a sharp estimate for the probabilities in ([3|) we have to refine a bit the proof of the local 
limit theorem, since in our case the variance of the Zj are not always bounded away from zero. 
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However it is enough to consider the case where Na'^ — )■ oo. There is httle loss of generahty in 
assuming that D = Q here. As usual, we start with the inversion formula 

P I f; Z, = r j = i- r e— -'E [e'-^U ^.) dx 



(244) 



where r = 2M — k. Consider first \x\ > (A^A) ^/^'^. Using an inequality (see Pittel |20j ) 

mv)\ < e(^'=^-H)/(^+i)/,(|r?|), 



we estimate 



2vr L 



|x|>(AfA)-5/i2 



-^r.,^ //Ke^^A)^^^ 



n 



hW 



i=2 
^NX(cosx-l)/A ^^ 

< gJVA[(cos((JVA)-5/i2)_l)/4] 

< g-{JVA)i/6/9. 



dx 



1 
< — 

2vr -/|a;|>{ArA)--Vi2 



(245) 



For \x\ < (NX) 5/ii^ putting rj = Ae*^ and using 



^ ^wp ^ 2^ ^^^ ^^^^ ^ .^^^^^ 



£=2 



fiW 



we expand X]£=2 -^^ ^°§ 



/HA) 



as a Taylor series around x = to obtain 



i=2 ^ J^^^> ^ ^ \l=2 



vfM 



v=x 



-^pME^-'^^^'^ 



,1=2 



Mv) 



v=x 



+Ox4p3 ^iV, 



v£=2 



fiiv) 



(246) 



v=v/ 



Here f/ = Ae*^, with 5; being between and x, and P = r]{d/dr]). Now, the coefficients of x^/2, x^/S! 
and x^ are Na'^, 0{Na'^), 0{Na'^) respectively, and a^ is of order A. (Use ([2]) and consider the 
effect of D on a power of r/.) So the second and the third terms in (1246P are o(l) uniformly for 
|x| < [NX)-^/'^'^. Therefore 

2vr J|x|<(ArA)-5/l2 Ji J2 Ji 



52 



where 



27r 



e*'=-^-'-'/2 dx 



|x|<(7VA)-5/i2 



1 / fc^ + 1 \ 



i - ''{^iP' 



A/;(A) 



A(A) y y|a-|<{ArA)-5/i2 
V i|x|<(ArA)-5/i2 

(a > is an absolute constant), and 

3 \ J|a;|<(ArA)-Vi2 y 

O 



2 I I iVA 



^3g-iV.^xV2^^ 



Using (p4i|) - (f250]) . we arrive at 






(248) 



(249) 



(250) 



X 



X 2 



B Mathematica Output 

In the computations below, £p{X) is represented by ep[x] and Op is represented by Ap and /3 is 
represented by B. The computation Ci is the justification for (I216p . 

fO[x_]:=Exp[x] 

fl[x_]:=fO[x] - 

f2[x_]:=fl[a;] - 

f3[x_]:=f2[;c] - 

el[x-]:=i| 
e2[x-]:=g| 

e3[x-]:=§|t 

e4[x-]:=T^ 

Yf 1 e4[x]+e5[a!]+e6[a] 

eqx_j. i_e5fa:] 
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e9[x_]:=xe4[a;] 

d[x_]:=Max[el[a:], e2[x], e3[a;], e4[x], e5[x], e6[a;], e7[x], e8[x]] 

0.000102752 

T =i- 2i/2Exp[Pi/4] 

B = -.01 + 2(1 - T) 

0.634794 

AO = B(Exp[2T/S] - 1) 

4.73302 

1.5312 

A2 = iV [lllIl^b^Mlntegrate [fgj^^iS, {., 0, 1}] ] 

1.41846 

A4[c_, x_]:=A0A3[c, x]/2 
A5[c_,x_]:=f + ^^ + ^ 
Cl[c_, x_]:=Alc - A2 - A5[c, x]d[a;] 
iV[Cl[15,16]] 
20.1217 
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